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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 

THEREOF 
FIELD OF INVENTION 
The present invention relates to Chlamydia antigens and corresponding DNA 
molecules, which can be used in methods to prevent and treat Chlamydia infection in mammals, 
such as humans. 

BACKGROUND OF THE INVENTION 

Chlamydiae are prokaryotes. They exhibit morphologic and structural similarities to 
gram-negative bacteria including a trilaminar outer membrane, which contains 
lipopolysaccharide and several membrane proteins that are structurally and functionally 
analogous to proteins found in E colL They are obligate intra-celhilar parasites with a unique 
biphasic life cycle consisting of a metabolically inactive but infectious extracellular stage and a 
replicating but non-infectious intracellular stage. The replicative stage of the life-cycle takes 
place within a membrane-bound inclusion which sequesters the bacteria away from the 
cytoplasm of the infected host cell. 

C pneumoniae is a common human pathogen, originally described as the TWAR strain 
of Chlamydia psittaci but subsequently recognised to be a new species. C. pneumoniae is 
antigenically, genetically and morphologically distinct from other chlamydia species (C. 
trachomatis, C pecorum and C psittaci). It shows 10% or less DNA sequence homology with 
either of C. trachomatis or Cpsittaci and so far appears to consist of only a single strain, TWAR. 

C pneumoniae is a common cause of community acquired pneumonia, only less frequent 
than Streptococcus pneumoniae and Mycoplasma pneumoniae (Ref 1,2). It can also cause upper 
respiratory tract symptoms and disease, including bronchitis and sinusitis (Ref 1,3,4,5). The 
great majority of the adult population (over 60%) has antibodies to C. pneumoniae (Ref 5), 
indicating past infection which was unrecognized or asymptomatic. 

Of considerable importance is the association of atherosclerosis and C pneumoniae 
infection. There are several epidemiological studies showing a correlation of previous infections 
with C. pneumoniae and heart attacks, coronary artery and carotid artery disease (Ref 6-10). 
Moreover, the organisms has been detected in atheromas and fatty streaks of the coronary, 
carotid, peripheral arteries and aorta (Ref 11-15). Viable C pneumoniae has been recovered 
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from the coronary and carotid artery (Ref 16,17). Furthermore, it has been shown that C 
pneumoniae can induce changes of atherosclerosis in a rabbit model (Ref 18). Taken together, 
these results indicate that it is highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial atherosclerosis remains to be 
demonstrated. 

A number of recent studies have also indicated an association between C. pneumoniae 
infection and asthma. Infection has been linked to wheezing, asthmatic bronchitis, adult-onset 
asthma and acute exacerbations of asthma in adults, and small-scale studies have shown that 
prolonged antibiotic treatment was effective at greatly reducing the severity of the disease in 
some individuals (Ref 19-24). 

In light of these results a protective vaccine against C. pneumoniae infection would be of 
considerable importance. There is not yet an effective vaccine for any human chlamydial 
infection. Nevertheless, studies with C. trachomatis and C psittaci indicate that this is an 
attainable goal. For example, mice which have recovered from a lung infection with C 
trachomatis are protected from infertility induced by a subsequent vaginal challenge (Ref 25). 
Similarly, sheep immunized with inactivated C psittaci were protected from subsequent 
chlamydial-induced abortions and stillbirths (Ref 26). Protection from chlamydial infections has 
been associated with Thl immune responses, particularly the induction of INFg - producing 
CD4+T-cells (Ref 27). The adoptive transfer of CD4+ cell lines or clones to nude or SCID mice 
conferred protection from challenge or cleared chronic disease (Ref 28,29), and in vivo 
depletion of CD4+ T cells exacerbated disease post-challenge (Ref 30,31). However, the 
presence of sufficiently high titres of neutralising antibody at mucosal surfaces can also exert a 
protective effect (Ref 32). 

The extent of antigenic variation within the species C pneumoniae is not well 
characterised. Serovars of C. trachomatis are defined on the basis of antigenic variation in 
MOMP, but published C. pneumoniae MOMP gene sequences show no variation between 
several diverse isolates of the organism (Ref 33-35). Regions of the protein known to be 
conserved in other chlamydial MOMPs are conserved in C pneumoniae (Ref 33,34). One study 
has described a strain of C pneumoniae with a MOMP of greater that usual molecular weight, 
but the gene for this has not been sequenced (Ref 1). Partial sequences of outer membrane 
protein 2 from nine diverse isolates were also found to be invariant (Ref 16). The genes for 
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HSP60 and HSP70 show little variation from other chlamydial species, as would be expected. 
The gene encoding a 76kDa antigen has been cloned from a single strain of C. pneumoniae. It 
has no significant similarity with other known chlamydial genes (Ref 4). 

Many antigens recognised by immune sera to C. pneumoniae are conserved across all 
chlamydias but 98kDa, 76 kDa and 54 kDa proteins may be C. /wew/wo*/ae-specific (Ref 2, 4, 
36). Immunoblotting of isolates with sera from patients does show variation of blotting patterns 
between isolates, indicating that serotypes C. pneumoniae may exist (Ref 1,16). However, the 
results are potentially confounded by the infection status of the patients, since immunoblot 
profiles of a patient's sera change with time post-infection. An assessment of the number and 
relative frequency of any serotypes, and the defining antigens, is not yet possible. 

C pneumoniae infection usually presents as an acute respiratory disease (i.e., cough, sore 
throat, hoarseness, and fever; abnormal chest sounds on auscultation). For most patients, the 
cough persists for 2 to 6 weeks, and recovery is slow. In approximately 10% of these cases, 
upper respiratory tract infection is followed by bronchitis or pneumonia. Furthermore, during a 
C. pneumoniae epidemic, subsequent co-infection with pneumococcus has been noted in about 
half of these pneumonia patients, particularly in the infirm and the elderly. As noted above, 
there is more and more evidence that C pneumoniae infection is also linked to diseases other 
than respiratory infections. 

The reservoir for the organism is presumably people. In contrast to C. psittaci infections, 
there is no known bird or animal reservoir. Transmission has not been clearly defined. It may 
result from direct contact with secretions, from formites, or from airborne spread. There is a long 
incubation period, which may last for many months. Based on analysis of epidemics, C 
pneumoniae appears to spread slowly through a population (case-to-case interval averaging 30 
days) because infected persons are inefficient transmitters of the organism. Susceptibility to C 
pneumoniae is universal. Reinfections occur during adulthood, following the primary infection as 
a child. C. pneumoniae appears to be an endemic disease throughout the world, noteworthy for 
superimposed intervals of increased incidence (epidemics) that persist for 2 to 3 years. 
C. trachomatis infection does not confer cross-immunity to C. pneumoniae. Infections are easily 
treated with oral antibiotics, tetracycline or erythromycin (2 g/d, for at least 10 to 14 d). A 
recently developed drug, azithromycin, is highly effective as a single-dose therapy against 
chlamydial infections. 
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In most instances, C pneumoniae infection is often mild and without complications, and 
up to 90% of infections are subacute or unrecognized. Among children in industrialized 
countries, infections have been thought to be rare up to the age of 5 y, although a recent study (E 
Nermann et al, Chlamydia pneumoniae in children with acute respiratory tract infections, Acta 
Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported that many children in this age group 
show PGR evidence of infection despite being seronegative, and estimates a prevalence of 17- 
19% in 2-4 y olds. In developing countries, the seroprevalence of C. pneumoniae antibodies 
among young children is elevated, and there are suspicions that C pneumoniae may be an 
important cause of acute lower respiratory tract disease and mortality for infants and children in 
tropical regions of the world. 

From seroprevalence studies and studies of local epidemics, the initial C pneumoniae 
infection usually happens between the ages of 5 and 20 y. In the USA, for example, there are 
estimated to be 30,000 cases of childhood pneumonia each year caused by C pneumoniae. 
Infections may cluster among groups of children or young adults (e.g., school pupils or military 
conscripts). 

C. pneumoniae causes 10 to 25% of community-acquired lower respiratory tract infections 
(as reported from Sweden, Italy, Finland, and the USA). During an epidemic, C. pneumonia 
infection may account for 50 to 60% of the cases of pneumonia. During these periods, also, more 
episodes of mixed infections with SL pneumoniae have been reported. 

Reinfection during adulthood is common; the clinical presentation tends to be milder. Based on 
population seroprevalence studies, there tends to be increased exposure with age, which is 
particularly evident among men. Some investigators have speculated that a persistent, 
asymptomatic C. pneumoniae infection state is common. 

In adults of middle age or older, C pneumoniae infection may progress to chronic 
bronchitis and sinusitis. A study in the USA revealed that the incidence of pneumonia caused by 
C. pneumoniae in persons younger than 60 years is 1 case per 1 ,000 persons per year; but in the 
elderly, the disease incidence rose three-fold. C pneumoniae infection rarely leads to 
hospitalization, except in patients with an underlying illness. 

SUMMARY OF THE INVENTION 
The present invention provides purified and isolated DNA molecules that encode 
Chlamydia polypeptides designated CPN 100635 (SEQ ID No: 1,2), which can be used in 
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methods to prevent, treat, and diagnose Chlamydia infection. The encoded polypeptides include 
polypeptides having the amino acid sequence shown in SEQ ID No:3 and 4. Those skilled in the 
art will appreciate that the invention also includes DNA molecules that encode mutants and 
derivatives of such polypeptides, which result from the addition, deletion, or substitution of non- 
essential amino acids as described herein. The invention also includes RNA molecules 
corresponding to the DNA molecules of the invention. 

In addition to the DNA and RNA molecules, the invention includes the corresponding 
polypeptides and monospecific antibodies that specifically bind to such polypeptides. 

The present invention has wide application and includes expression cassettes, vectors, 
and cells transformed or transfected with the polynucleotides of the invention. Accordingly, the 
present invention provides (i) a method for producing a polypeptide of the invention in a 
recombinant host system and related expression cassettes, vectors, and transformed or transfected 
cells; (ii) a live vaccine vector, such as a pox virus, Salmonella typhimurium, or Vibrio cholerae 
vector, containing a polynucleotide of the invention, such vaccine vectors being useful for, e.g. y 
preventing and treating Chlamydia infection, in combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or prophylactic methods; (iii) a 
therapeutic and/or prophylactic method involving administration of an RNA or DNA molecule of 
the invention, either in a naked form or formulated with a delivery vehicle, a polypeptide or 
combination of polypeptides, or a monospecific antibody of the invention, and related 
pharmaceutical compositions; (iv) a method for diagnosing the presence of Chlamydia in a 
biological sample, which can involve the use of a DNA or RNA molecule, a monospecific 
antibody, or a polypeptide of the invention; and (v) a method for purifying a polypeptide of the 
invention by antibody-based affinity chromatography. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The present invention will be further understood from the following description with 
reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN 100635 (SEQ ID No: 1 - entire 
sequence and SEQ ID No: 2 - coding sequence) and the deduced amino acid sequence of the 
CPN1 00635 protein from Chlamydia pneumoniae (SEQ ID No: 3 - unprocessed and 4 - 
processed). 
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Figure 2 shows the restriction enzyme analysis of the gene encoding the C. 
pneumoniae CPN1 00635 gene. 

DETAILED DESCRIPTION OF INVENTION 

In the C pneumoniae genome, open reading frames (ORFs) encoding chlamydial 
polypeptides have been identified These polypeptides include polypeptides permanently found 
in the bacterial membrane structure, polypeptides that are present in the external vicinity of the 
bacterial membrane, include polypeptides permanently found in the inclusion membrane 
structure, polypeptides that are present in the external vicinity of the inclusion membrane, and 
polypeptides that are released into the cytoplasm of the infected cell. These polypeptides can be 
used in vaccination methods for preventing and treating Chlamydia infection. 

According to a first aspect of the invention, there are provided isolated 
polynucleotides encoding the precursor and mature forms of Chlamydia polypeptides. 

An isolated polynucleotide of the invention encodes (i) a polypeptide having an 
amino acid sequence that is homologous to a Chlamydia amino acid, the Chlamydia amino acid 
sequence being selected from the group consisting of: 

(a) the amino acid sequences as shown: (SEQ ID No: 3 and 4) 

The term "isolated polynucleotide" is defined as a polynucleotide removed from the 
environment in which it naturally occurs. For example, a naturally-occurring DNA molecule 
present in the genome of a living bacteria or as part of a gene bank is not isolated, but the same 
molecule separated from the remaining part of the bacterial genome, as a result of, e.g., a cloning 
event (amplification), is isolated. Typically, an isolated DNA molecule is free from DNA 
regions (e.g., coding regions) with which it is immediately contiguous at the 5' or 3 r end, in the 
naturally occurring genome. Such isolated polynucleotides could be part of a vector or a 
composition and still be isolated in that such a vector or composition is not part of its natural 
environment. 

A polynucleotide of the invention can be in the form of RNA or DNA (e.g., cDNA, 
genomic DNA, or synthetic DNA), or modifications or combinations thereof. The DNA can be 
double-stranded or single-stranded, and, if single-stranded, can be the coding strand or the non- 
coding (anti-sense) strand. The sequence that encodes a polypeptide of the invention as shown in 
SEQ ID NOs: 1 and 2, can be (a) the coding sequence as shown in SEQ ID NOs:2 (b) a 
ribonucleotide sequence derived by transcription of (a) ; or (c) a different coding sequence; this 
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latter, as a result of the redundancy or degeneracy of the genetic code, encodes the same 
polypeptides as the DNA molecules of which the nucleotide sequences are illustrated in SEQ ID 
NOs:l to 2. 

By "polypeptide" or "protein" is meant any chain of amino acids, regardless of length 
or post-translational modification (e.g., glycosylation or phosphorylation). Both terms are used 
interchangeably in the present application. 

By "homologous amino acid sequence" is meant an amino acid sequence that differs 
from an amino acid sequence shown in SEQ ID No: 3 or 4, only by one or more conservative 
amino acid substitutions, or by one or more non-conservative amino acid substitutions, deletions, 
or additions located at positions at which they do not destroy the specific antigenicity of the 
polypeptide. 

Preferably, such a sequence is at least 75%, more preferably 80%, and most 
preferably 90% identical to an amino acid sequence shown in SEQ ID No: 3 or 4. 

Homologous amino acid sequences include sequences that are identical or 
substantially identical to an amino acid sequence as shown in SEQ ID No: 3 or 4. By "amino 
acid sequence substantially identical" is meant a sequence that is at least 90%, preferably 95%, 
more preferably 97%, and most preferably 99% identical to an amino acid sequence of reference 
and that preferably differs from the sequence of reference, if at all, by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions typically include substitutions among amino 
acids of the same class. These classes include, for example, amino acids having uncharged polar 
side chains, such as asparagine, glutamine, serine, threonine, and tyrosine; amino acids having 
basic side chains, such as lysine, arginine, and histidine; amino acids having acidic side chains, 
such as aspartic acid and glutamic acid; and amino acids having nonpolar side chains, such as 
glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine. 

Homology is typically measured using sequence analysis software (e.g., Sequence 
Analysis Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Similar amino acid 
sequences are aligned to obtain the maximum degree of homology {i.e., identity). To this end, it 
may be necessary to artificially introduce gaps into the sequence. Once the optimal alignment 
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has been set up, the degree of homology (i.e., identity) is established by recording all of the 
positions in which the amino acids of both sequences are identical, relative to the total number of 
positions. 

Homologous polynucleotide sequences are defined in a similar way. Preferably, a 
homologous sequence is one that is at least 45%, more preferably 60%, and most preferably 85% 
identical to (i) a coding sequence of SEQ ID NOs:l and 2. 

Polypeptides having a sequence homologous to one of the sequences shown in SEQ 
ID NO: 3 or 4 , include naturally-occurring allelic variants, as well as mutants or any other non- 
naturally occurring variants that are analogous in terms of antigenicity, to a polypeptide having a 
sequence as shown in SEQ ID NO: 3 or 4. 

As is known in the art, an allelic variant is an alternate form of a polypeptide that is 
characterized as having a substitution, deletion, or addition of one or more amino acids that does 
not alter the biological function of the polypeptide. By "biological function" is meant the 
function of the polypeptide in the cells in which it naturally occurs, even if the function is not 
necessary for the growth or survival of the cells. For example, the biological function of a porin 
is to allow the entry into cells of compounds present in the extracellular medium. The biological 
function is distinct from the antigenic function. A polypeptide can have more than one 
biological function. 

Allelic variants are very common in nature. For example, a bacterial species, e.g.,C. 
pneumoniae, is usually represented by a variety of strains that differ from each other by minor 
allelic variations. Indeed, a polypeptide that fulfills the same biological function in different 
strains can have an amino acid sequence that is not identical in each of the strains. Such an 
allelic variation may be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes from, e.g., 
studies of the Chlamydial MOMP antigen. The amino acid sequence of the MOMP varies from 
strain to strain, yet cross-strain antibody binding plus neutralization of infectivity occurs, 
indicating that the MOMP, when used as an immunogen, is tolerant of amino acid variations. 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can easily be 
retrieved by polymerase chain reaction (PCR) amplification of genomic bacterial DNA extracted 
by conventional methods. This involves the use of synthetic oligonucleotide primers matching 
upstream and downstream of the 5* and 3' ends of the encoding domain. Suitable primers can be 
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designed according to the nucleotide sequence information provided in SEQ ID NOs:l and 2. 
Typically, a primer can consist of 10 to 40, preferably 15 to 25 nucleotides. It may be also 
advantageous to select primers containing C and G nucleotides in a proportion sufficient to 
ensure efficient hybridization; e.g., an amount of C and G nucleotides of at least 40%, preferably 
5 50% of the total nucleotide amount. 

Useful homologs that do not naturally occur can be designed using known methods 
for identifying regions of an antigen that are likely to be tolerant of amino acid sequence changes 
and/or deletions. For example, sequences of the antigen from different species can be compared 
to identify conserved sequences. 
10 Polypeptide derivatives that are encoded by polynucleotides of the invention include, 

e.g., fragments, polypeptides having large internal deletions derived from full-length 
^ polypeptides, and fusion proteins. 

Q Polypeptide fragments of the invention can be derived from a polypeptide having a 

i=k sequence homologous to any of the sequences shown in SEQ ID NO: 3 or 4 , to the extent that 

4? 15 the fragments retain the substantial antigenicity of the parent polypeptide (specific antigenicity). 
%i Polypeptide derivatives can also be constructed by large internal deletions that remove a 

5 substantial part of the parent polypeptide, while retaining specific antigenicity. Generally, 

hj polypeptide derivatives should be about at least 12 amino acids in length to maintain 

j~f antigenicity. Advantageously, they can be at least 20 amino acids, preferably at least 50 amino 

20 acids, more preferably at least 75 amino acids, and most preferably at least 100 amino acids in 
length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be designed using 
computer-assisted analysis of amino acid sequences in order to identify sites in protein antigens 
having potential as surface-exposed, antigenic regions (Ref 37). 
25 Polypeptide fragments and polypeptides having large internal deletions can be used 

for revealing epitopes that are otherwise masked in the parent polypeptide and that may be of 
importance for inducing a protective T cell-dependent immune response. Deletions can also 
remove immunodominant regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments and variants of 
30 protein immunogens as vaccines, as all that is required to induce an immune response to a 
protein is a small (e.g., 8 to 10 amino acid) immunogenic region of the protein. This has been 



V4 



RY50 101998 

done for a number of vaccines against pathogens other than Chlamydia. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such as murine 
mammary tumor virus, peptide containing 11 amino acids; (Ref 38), Semliki Forest virus, 
peptide containing 16 amino acids (Ref 39), and canine parvovirus, 2 overlapping peptides, each 
containing 1 5 amino acids (Ref 40), have been shown to be effective vaccine antigens against 
their respective pathogens. 

Polynucleotides encoding polypeptide fragments and polypeptides having large 
internal deletions can be constructed using standard methods (Ref 41), for example, by PCR, 
including inverse PCR, by restriction enzyme treatment of the cloned DNA molecules, or by the 
method of Kunkel et ah (Ref 42) biological material available at Stratagene. 

A polypeptide derivative can also be produced as a fusion polypeptide that contains a 
polypeptide or a polypeptide derivative of the invention fused, e.g. y at the N- or C-terminal end, 
to any other polypeptide (hereinafter referred to as a peptide tail). Such a product can be easily 
obtained by translation of a genetic fusion, i.e., a hybrid gene. Vectors for expressing fusion 
polypeptides are commercially available, such as the pMal-c2 or pMal-p2 systems of New 
England Biolabs, in which the peptide tail is a maltose binding protein, the glutathione-S- 
transferase system of Pharmacia, or the His-Tag system available from Novagen. These and 
other expression systems provide convenient means for further purification of polypeptides and 
derivatives of the invention. 

Another particular example of fusion polypeptides included in invention includes a 
polypeptide or polypeptide derivative of the invention fused to a polypeptide having adjuvant 
activity, such as, e.g., subunit B of either cholera toxin or E, coli heat-labile toxin. Several 
possibilities are can be used for achieving fusion. First, the polypeptide of the invention can be 
fused to the N-, or preferably, to the C-terminal end of the polypeptide having adjuvant activity. 
Second, a polypeptide fragment of the invention can be fused within the amino acid sequence of 
the polypeptide having adjuvant activity. 

As stated above, the polynucleotides of the invention encode Chlamydia polypeptides 
in precursor or mature form. They can also encode hybrid precursors containing heterologous 
signal peptides, which can mature into polypeptides of the invention. By "heterologous signal 
peptide" is meant a signal peptide that is not found in the naturally-occurring precursor of a 
polypeptide of the invention. 
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A polynucleotide of the invention, having a homologous coding sequence, hybridizes, 
preferably under stringent conditions, to a polynucleotide having a sequence as shown in SEQ ID 
NOs:l to 2. Hybridization procedures are, e.g., described in Ausubel et al., (Ref 41), Silhavy et 
al (Ref 43); Davis et al (ref 44). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in a formula that allows calculation of a critical value, the 
melting temperature above which two complementary DNA strands separate from each other Ref 
45). This formula is as follows: Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion 
concentration) - 0.6 x (% formamide). Under appropriate stringency conditions, hybridization 
temperature (Th) is approximately 20 to 40°C, 20 to 25°C, or, preferably 30 to 40°C below the 
calculated Tm. Those skilled in the art will understand that optimal temperature and salt 
conditions can be readily determined empirically in preliminary experiments using conventional 
procedures. 

For example, stringent conditions can be achieved, both for pre-hybridizing and 
hybridizing incubations, (i) within 4-16 hours at 42°C, in 6 x SSC containing 50% formamide or 
(ii) within 4-16 hours at 65°C in an aqueous 6 x SSC solution (1 M NaCl, 0.1 M sodium citrate 
(pH 7.0)). 

For polynucleotides containing 30 to 600 nucleotides, the above formula is used and 
then is corrected by subtracting (600/polynucleotide size in base pairs). Stringency conditions 
are defined by a Th that is 5 to 10°C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 bases do not 
exactly follow the rules set forth above. In such cases, the formula for calculating the Tm is as 
follows: Tm = 4 x (G+C) + 2 (A+T). For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, or modifications 
or combinations thereof, can have various applications. For example, a DNA molecule can be 
used (i) in a process for producing the encoded polypeptide in a recombinant host system, (ii) in 
the construction of vaccine vectors such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Chlamydia infection, (iii) as a vaccine agent (as well 
as an RNA molecule), in a naked form or formulated with a delivery vehicle and, (iv) in the 
construction of attenuated Chlamydia strains that can over-express a polynucleotide of the 
invention or express it in a non-toxic, mutated form. 
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According to a second aspect of the invention, there is therefore provided (i) an 
expression cassette containing a DNA molecule of the invention placed under the control of the 
elements required for expression, in particular under the control of an appropriate promoter; (ii) 
an expression vector containing an expression cassette of the invention; (iii) a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, as well as (iv) a process for producing a polypeptide or polypeptide derivative 
encoded by a polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or vector of the 
invention, under conditions that allow expression of the DNA molecule of the invention and, 
recovering the encoded polypeptide or polypeptide derivative from the cell culture. 

A recombinant expression system can be selected from procaryotic and eucaryotic 
hosts. Eucaryotic hosts include yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris), 
mammalian cells (e.g., COS1, NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera 
frugiperda (SF9) cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. 
Bacterial and eucaryotic cells are available from a number of different sources to those skilled in 
the art, e.g., the American Type Culture Collection (ATCC; Rockville, Maryland). 

The choice of the expression system depends on the features desired for the expressed 
polypeptide. For example, it may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form. 

The choice of the expression cassette will depend on the host system selected as well 
as the features desired for the expressed polypeptide. Typically, an expression cassette includes 
a promoter that is functional in the selected host system and can be constitutive or inducible; a 
ribosome binding site; a start codon (ATG) if necessary, a region encoding a signal peptide, e.g., 
a lipidation signal peptide; a DNA molecule of the invention; a stop codon; and optionally a 3' 
terminal region (translation and/or transcription terminator). The signal peptide encoding region 
is adjacent to the polynucleotide of the invention and placed in proper reading frame. The signal 
peptide-encoding region can be homologous or heterologous to the DNA molecule encoding the 
mature polypeptide and can be specific to the secretion apparatus of the host used for expression. 
The open reading frame constituted by the DNA molecule of the invention, solely or together 
with the signal peptide, is placed under the control of the promoter so that transcription and 
translation occur in the host system. Promoters, signal peptide encoding regions are widely 
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known and available to those skilled in the art and includes, for example, the promoter of 
Salmonella typhimurium (and derivatives) that is inducible by arabinose (promoter araB) and is 
functional in Gram-negative bacteria such as £ coli (as described in U.S. Patent No. 5,028,530 
and in Cagnon et ah, (Ref 46); the promoter of the gene of bacteriophage T7 encoding RNA 
polymerase, that is functional in a number of E. coli strains expressing 17 polymerase (described 
in U.S. Patent No. 4,952,496); OspA lipidation signal peptide ; and RlpB lipidation signal 
peptide (Ref 47). 

The expression cassette is typically part of an expression vector, which is selected for 
its ability to replicate in the chosen expression system. Expression vectors (e.g., plasmids or 
viral vectors) can be chosen from those described in Pouwels et ah (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). They can be purchased from various commercial 
sources. 

Methods for transforming/transfecting host cells with expression vectors will depend 
on the host system selected as described in Ausubel et ah 9 (Ref 41). 

Upon expression, a recombinant polypeptide of the invention (or a polypeptide 
derivative) is produced and remains in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded in the cellular membrane. The 
polypeptide can then be recovered in a substantially purified form from the cell extract or from 
the supernatant after centrifugation of the recombinant cell culture. Typically, the recombinant 
polypeptide can be purified by antibody-based affinity purification or by any other method that 
can be readily adapted by a person skilled in the art, such as by genetic fusion to a small affinity 
binding domain. Antibody-based affinity purification methods are also available for purifying a 
polypeptide of the invention extracted from a Chlamydia strain. Antibodies useful for purifying 
by immunoaffinity the polypeptides of the invention can be obtained as described below. 

A polynucleotide of the invention can also be useful in the vaccine field, e.g.> for 
achieving DNA vaccination. There are two major possibilities, either using a viral or bacterial 
host as gene delivery vehicle (live vaccine vector) or administering the gene in a free form, e.g., 
inserted into a plasmid. Therapeutic or prophylactic efficacy of a polynucleotide of the invention 
can be evaluated as described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a vaccine vector 
such as a poxvirus, containing a DNA molecule of the invention, placed under the control of 
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elements required for expression; (ii) a composition of matter containing a vaccine vector of the 
invention, together with a diluent or carrier; particularly, (iii) a pharmaceutical composition 
containing a therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Chlamydia in a mammal (e.g., 
5 a human; alternatively, the method can be used in veterinary applications for treating or 
preventing Chlamydia infection of animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune response to Chlamydia ; and 
particularly, (v) a method for preventing and/or treating a Chlamydia (e.g., C. trachomatis, C 
10 psittaci, C. pneumonia, C. pecorum) infection, which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an individual in need. Additionally, 
the third aspect of the invention encompasses the use of a vaccine vector of the invention in the 
preparation of a medicament for preventing and/or treating Chlamydia infection. 

A vaccine vector of the invention can express one or several polypeptides or 
□ 15 derivatives of the invention, as well as at least one additional Chlamydia antigen, fragment, 

u.\ homolog, mutant, or derivative thereof. In addition, it can express a cytokine, such as 

S3 

~~ interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the immune response (adjuvant 

^ effect). Thus, a vaccine vector can include an additional DNA sequence encoding, e.g., a 

P.-J. 

Cf chlamydial antigen , or a cytokine, placed under the control of elements required for expression 

20 in a mammalian cell. 

A 1 * 

^ Alternatively, a composition of the invention can include several vaccine vectors, 

each of them being capable of expressing a polypeptide or derivative of the invention. A 
composition can also contain a vaccine vector capable of expressing an additional Chlamydia 
antigen, or a subunit, fragment, homolog, mutant, or derivative thereof; or a cytokine such as IL- 
25 2 or IL-12. 

In vaccination methods for treating or preventing infection in a mammal, a vaccine 
vector of the invention can be administered by any conventional route in use in the vaccine field, 
particularly, to a mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, 
vaginal, or urinary tract) surface or via the parenteral (e.g., subcutaneous, intradermal, 
30 intramuscular, intravenous, or intraperitoneal) route. Preferred routes depend upon the choice of 
the vaccine vector. The administration can be achieved in a single dose or repeated at intervals. 
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The appropriate dosage depends on various parameters understood by skilled artisans such as the 
vaccine vector itself, the route of administration or the condition of the mammal to be vaccinated 
(weight, age and the like). 

Live vaccine vectors available in the art include viral vectors such as adenoviruses 
5 and poxviruses as well as bacterial vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 
Lactobacillus, Bacille bilie de Calmette-Guerin (BCG), and Streptococcus. 

An example of an adenovirus vector, as well as a method for constructing an 
adenovirus vector capable of expressing a DNA molecule of the invention, are described in 
U.S. Patent No. 4,920,209. Poxvirus vectors that can be used include, e.g., vaccinia and canary 
10 pox virus, described in U.S. Patent No. 4,722,848 and U.S. Patent No. 5,364,773, respectively 
(also see, e.g., Tartaglia et aL, Virology (1992) 188:217) for a description of a vaccinia virus 
vector; and Taylor et al, Vaccine (1995) 13:539 for a reference of a canary pox). Poxvirus 
vectors capable of expressing a polynucleotide of the invention can be obtained by homologous 
recombination as described in Kieny et aL, Nature (1984) 312:163 so that the polynucleotide of 
Q 15 the invention is inserted in the viral genome under appropriate conditions for expression in 
yj mammalian cells. Generally, the dose of vaccine viral vector, for therapeutic or prophylactic use, 

can be of from about lxlO 4 to about lxlO 11 , advantageously from about lxlO 7 to about lxlO 10 , 
preferably of from about lxl 0 7 to about lxl 0 9 plaque-forming units per kilogram. Preferably, 
viral vectors are administered parenterally; for example, in 3 doses, 4 weeks apart. Those skilled 
| j 20 in the art recognize that it is preferable to avoid adding a chemical adjuvant to a composition 

in- 

containing a viral vector of the invention and thereby minimizing the immune response to the 
viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that are useful as a live oral vaccine 
are described in Mekalanos et aL, Nature (1983) 306:551 and U.S. Patent No. 4,882,278 (strain 

25 in which a substantial amount of the coding sequence of each of the two ctxA alleles has been 
deleted so that no functional cholerae toxin is produced); WO 92/1 1 354 (strain in which the irgA 
locus is inactivated by mutation; this mutation can be combined in a single strain with ctxA 
mutations); and WO 94/1533 (deletion mutant lacking functional ctxA and attRSl DNA 
sequences). These strains can be genetically engineered to express heterologous antigens, as 

30 described in WO 94/19482. An effective vaccine dose of a Vibrio cholerae strain capable of 
expressing a polypeptide or polypeptide derivative encoded by a DNA molecule of the invention 
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can contain, e.g., about 1x10 s to about lxlO 9 , preferably about lxlO 6 to about 1x10 s viable 
bacteria in an appropriate volume for the selected route of administration. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically engineered for recombinant 
expression of heterologous antigens or not, and their use as oral vaccines are described in 
Nakayamae/ ah (Bio/Technology (1988) 6:693) and WO 92/11361. Preferred routes of 
administration include all mucosal routes; most preferably, these vectors are administered 
intranasally or orally. 

Others bacterial strains useful as vaccine vectors are described in High et al. 9 EMBO 
(1992) 11:1991 and Sizemore et aL, Science (1995) 270:299 (Shigella Jlexneri); Medaglini et al. 9 
Proc. Natl. Acad. Sci. USA (1995) 92:6868 (Streptococcus gordonii); and Flynn J.L., Cell. Mol. 
Biol. (1994) 40 (suppl I):31, WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 
92/21376 (Bacille Calmette Guerin). 

In bacterial vectors, polynucleotide of the invention can be inserted into the bacterial 
genome or can remain in a free state, carried on a plasmid. 

An adjuvant can also be added to a composition containing a vaccine bacterial vector. 
A number of adjuvants are known to those skilled in the art. Preferred adjuvants can be selected 
from the list provided below. 

According to a fourth aspect of the invention, there is also provided (i) a composition 
of matter containing a polynucleotide of the invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition containing a therapeutically or prophylactically effective amount of 
a polynucleotide of the invention; (iii) a method for inducing an immune response against 
Chlamydia, in a mammal, by administering to the mammal, an immunogenically efFective 
amount of a polynucleotide of the invention to elicit an immune response, e.g., a protective 
immune response to Chlamydia] and particularly, (iv) a method for preventing and/or treating a 
Chlamydia (e.g., C. trachomatis, C. psittaci, C pneumoniae, or C. pecorum) infection, by 
administering a prophylactic or therapeutic amount of a polynucleotide of the invention to an 
individual in need. Additionally, the fourth aspect of the invention encompasses the use of a 
polynucleotide of the invention in the preparation of a medicament for preventing and/or treating 
Chlamydia infection. The fourth aspect of the invention preferably includes the use of a DNA 
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molecule placed under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate in a mammalian genome. 

Polynucleotides (DNA or RNA) of the invention can also be administered as such to a 
mammal for vaccine, e.g., therapeutic or prophylactic, purpose. When a DNA molecule of the 
invention is used, it can be in the form of a plasmid that is unable to replicate in a mammalian 
cell and unable to integrate in the mammalian genome. Typically, a DNA molecule is placed 
under the control of a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. Patent No. 4,168,062) and the 
Rous Sarcoma Virus promoter (described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281). 
The desmin promoter (Li et al, Gene (1989) 78:243, Li & Paulin, J. Biol. Chem. (1991) 
266:6562 and Li & Paulin, J. Biol. Chem. (1993) 268:10403) is tissue-specific and drives 
expression in muscle cells. More generally, useful vectors are described, /.a., WO 94/21797 and 
Hartikka et aL y Human Gene Therapy (1996) 7:1205. 

For DNA/RNA vaccination, the polynucleotide of the invention can encode a 
precursor or a mature form. When it encodes a precursor form, the precursor form can be 
homologous or heterologous. In the latter case, a eucaryotic leader sequence can be used, such as 
the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several polynucleotides of the 
invention. It can also contain at least one additional polynucleotide encoding another Chlamydia 
antigen such as urease subunit A, B, or both; or a fragment, derivative, mutant, or analog thereof. 
A polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin-12 (IL-12), can 
also be added to the composition so that the immune response is enhanced. These additional 
polynucleotides are placed under appropriate control for expression. Advantageously, DNA 
molecules of the invention and/or additional DNA molecules to be included in the same 
composition, can be carried in the same plasmid. 

Standard techniques of molecular biology for preparing and purifying polynucleotides 
can be used in the preparation of polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention can be formulated according to various methods. 

First, a polynucleotide can be used in a naked form, free of any delivery vehicles, 
such as anionic liposomes, cationic lipids, microparticles, e.g., gold microparticles, precipitating 
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agents, e.g., calcium phosphate, or any other transfection-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, such as sterile 
saline or sterile buffered saline, with or without a carrier. When present, the carrier preferably is 
isotonic, hypotonic, or weakly hypertonic, and has a relatively low ionic strength, such as 
5 provided by a sucrose solution, e.g., a solution containing 20% sucrose. 

Alternatively, a polynucleotide can be associated with agents that assist in cellular 
uptake. It can be, i.a. y (i) complemented with a chemical agent that modifies the cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) encapsulated into liposomes, or 
(iii) associated with cationic lipids or silica, gold, or tungsten microparticles. 
10 Anionic and neutral liposomes are well-known in the art (see, e.g., Liposomes: A 

Practical Approach, RPC New Ed, IRL press (1990), for a detailed description of methods for 
making liposomes) and are useful for delivering a large range of products, including 
C3 polynucleotides. j 

ftii 

Cationic lipids are also known in the art and are commonly used for gene delivery. 
J; 15 Such lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3^oleyloxy)propyl]-N,N,N- 
y trimethylammonium chloride), DOTAP (l^-bis(oleyloxy)-3-(trimethylammonio)propane), 

3 DDAB (dimethyldioctadecylammonium bromide), DOGS (dioctadecylamidologlycyl spermine) 

hj and cholesterol derivatives such as DC-Choi (3 beta-(N-(N',N'-dimethyl aminomethane)- 

^ carbamoyl) cholesterol). A description of these cationic lipids can be found in EP 187,702, 

20 WO 90/1 1092, U.S. Patent No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
No. 5,527,928. Cationic lipids for gene delivery are preferably used in association with a neutral 
lipid such as DOPE (dioleyl phosphatidylethanolamine), as, for example, described in 
WO 90/1 1092. 

Other transfection-facilitating compounds can be added to a formulation containing 
25 cationic liposomes. A number of them are described in, e.g., WO 93/18759, WO 93/19768, WO 
94/25608, and WO 95/2397. They include, /.a, spermine derivatives useful for facilitating the 
transport of DNA through the nuclear membrane (see, for example, WO 93/18759) and 
membrane-permeabilizing compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/1 9768). 
30 Gold or tungsten microparticles can also be used for gene delivery, as described in 

WO 91/359, WO 93/17706, and Tang*/ ah (Nature (1992) 356:152). In this case, the 
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microparticle-coated polynucleotides can be injected via intradermal or intraepidermal routes 
using a needleless injection device ("gene gun"), such as those described in U.S. Patent 
No. 4,945,050, U.S. Patent No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient depends, e.g., on the strength of 
the promoter used in the DNA construct, the immunogenicity of the expressed gene product, the 
condition of the mammal intended for administration (e.g., the weight, age, and general health of 
the mammal), the mode of administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 \ig to about 1 mg, preferably, 
from about 10 ng to about 800 ug and, more preferably, from about 25 |ag to about 250 jig, can 
be administered to human adults. The administration can be achieved in a single dose or 
repeated at intervals. 

The route of administration can be any conventional route used in the vaccine field. 
As general guidance, a polynucleotide of the invention can be administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, intestinal, rectal, vaginal, and urinary tract 
surface; or via a parenteral route, e.g., by an intravenous, subcutaneous, intraperitoneal, 
intradermal, intraepidermal, or intramuscular route. The choice of the administration route will 
depend on, e.g., the formulation that is selected. A polynucleotide formulated in association with 
bupivacaine is advantageously administered into muscles. When a neutral or anionic liposome or 
a cationic lipid, such as DOTMA or DC-Choi, is used, the formulation can be advantageously 
injected via intravenous, intranasal (aerosolization), intramuscular, intradermal, and 
subcutaneous routes. A polynucleotide in a naked form can advantageously be administered via 
the intramuscular, intradermal, or sub-cutaneous routes. 

Although not absolutely required, such a composition can also contain an adjuvant. If 
so, a systemic adjuvant that does not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present application enables the design of 
specific nucleotide probes and primers that can be useful in diagnosis. Accordingly, in a fifth 
aspect of the invention, there is provided a nucleotide probe or primer having a sequence found 
in or derived by degeneracy of the genetic code from a sequence shown in SEQ ID NO: 1 to 2. 

The term "probe" as used in the present application refers to DNA (preferably single 
stranded) or RNA molecules (or modifications or combinations thereof) that hybridize under the 
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stringent conditions, as defined above, to nucleic acid molecules having sequences homologous 
to those shown in SEQ ID NOs:l and 2, or to a complementary or anti-sense sequence. 
Generally, probes are significantly shorter than full-length sequences shown in SEQ ID NOs:l 
and 2; for example, they can contain from about 5 to about 100, preferably from about 10 to 
about 80 nucleotides. In particular, probes have sequences that axe at least 75%, preferably at 
least 85%, more preferably 95% homologous to a portion of a sequence as shown in SEQ ID 
NOs:l and 2 or that are complementary to such sequences. Probes can contain modified bases 
such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimemylarnino-5-deoxyuridine, or 
diamino-2, 6-purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a polyamide (Nielsen et aL, Science (1991) 
254:1497) and phosphate residues can be replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 2-hydroxyl group on 
ribonucleotides can be modified by including, e.g., alkyl groups. 

Probes of the invention can be used in diagnostic tests, as capture or detection probes. 
Such capture probes can be conventionally immobilized on a solid support, directly or indirectly, 
by covalent means or by passive adsorption. A detection probe can be labelled by a detection 
marker selected from radioactive isotopes; enzymes such as peroxidase, alkaline phosphatase, 
and enzymes able to hydrolyze a chromogenic, fluorogenic, or luminescent substrate; compounds 
that are chromogenic, fluorogenic, or luminescent; nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional hybridization technique, such 
as dot blot (Maniatis et aL, Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot (Southern, J. MoL Biol. (1975) 
98:503), northern blot (identical to Southern blot to the exception that RNA is used as a target), 
or the sandwich technique (Dunn et aL, Cell (1977) 12:23). The latter technique involves the use 
of a specific capture probe and/or a specific detection probe with nucleotide sequences that at 
least partially differ from each other. 

A primer is usually a probe of about 10 to about 40 nucleotides that is used to initiate 
enzymatic polymerization of DNA in an amplification process (e.g., PCR), in an elongation 
process, or in a reverse transcription method. In a diagnostic method involving PCR, primers 
can be labelled. 
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Thus, the invention also encompasses (i) a reagent containing a probe of the invention 
for detecting and/or identifying the presence of Chlamydia in a biological material; (ii) a method 
for detecting and/or identifying the presence of Chlamydia in a biological material, in which (a) a 
sample is recovered or derived from the biological material, (b) DNA or RNA is extracted from 
5 the material and denatured, and (c) exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization conditions, such that hybridization is 
detected; and (iii) a method for detecting and/or identifying the presence of Chlamydia in a 
biological material, in which (a) a sample is recovered or derived from the biological material, 
(b) DNA is extracted therefrom, (c) the extracted DNA is primed with at least one, and preferably 
: 10 two, primers of the invention and amplified by polymerase chain reaction, and (d) the amplified 
DNA fragment is produced. 

As previously mentioned, polypeptides that can be produced upon expression of the 

Sri 

□ newly identified open reading frames are useful vaccine agents. 

y* 

Therefore, a sixth aspect of the invention features a substantially purified polypeptide 
p 15 or polypeptide derivative having an amino acid sequence encoded by a polynucleotide of the 
invention. 

s A "substantially purified polypeptide" is defined as a polypeptide that is separated 

ffj from the environment in which it naturally occurs and/or that is free of the majority of the 

£L polypeptides that are present in the environment in which it was synthesized. For example, a 

Mjj 20 substantially purified polypeptide is free from cytoplasmic polypeptides. Those skilled in the art 
will understand that the polypeptides of the invention can be purified from a natural source, L e. , a 
Chlamydia strain, or can be produced by recombinant means. 

Homologous polypeptides or polypeptide derivatives encoded by polynucleotides of 
the invention can be screened for specific antigenicity by testing cross-reactivity with an 
25 antiserum raised against the polypeptide of reference having an amino acid sequence as shown in 
SEQ ID NOs:3 to 4 . Briefly, a monospecific hyperimmune antiserum can be raised against a 
purified reference polypeptide as such or as a fusion polypeptide, for example, an expression 
product of MBP, GST, or His-tag systems or a synthetic peptide predicted to be antigenic. The 
homologous polypeptide or derivative screened for specific antigenicity can be produced as such 
30 or as a fusion polypeptide. In this latter case and if the antiserum is also raised against a fusion 
polypeptide, two different fusion systems are employed. Specific antigenicity can be determined 
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according to a number of methods, including Western blot (Towbin et al., Proc. Natl. Acad. Sci. 
USA (1979) 76:4350), dot blot, and ELISA, as described below. 

In a Western blot assay, the product to be screened, either as a purified preparation or 
a total E. coli extract, is submitted to SDS-Page electrophoresis as described by Laemmli (Nature 
(1970) 227:680). After transfer to a nitrocellulose membrane, the material is further incubated 
with the monospecific hyperimmune antiserum diluted in the range of dilutions from about 1 :5 to 
about 1:5000, preferably from about 1:100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of the dilutions in the above range. 

In an ELISA assay, the product to be screened is preferably used as the coating 
antigen. A purified preparation is preferred, although a whole cell extract can also be used. 
Briefly, about 100 \xl of a preparation at about 10 \ig protein/ml are distributed into wells of a 96- 
well polycarbonate ELISA plate. The plate is incubated for 2 hours at 37°C then overnight at 
4°C. The plate is washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
(PBS/Tween buffer). The wells are saturated with 250 |il PBS containing 1% bovine serum 
albumin (BSA) to prevent non-specific antibody binding. After 1 hour incubation at 37°C, the 
plate is washed with PBS/Tween buffer. The antiserum is serially diluted in PBS/Tween buffer 
containing 0.5% BSA. 100 |xl of dilutions are added per well. The plate is incubated for 
90 minutes at 37°C, washed and evaluated according to standard procedures. For example, a 
goat anti-rabbit peroxidase conjugate is added to the wells when specific antibodies were raised 
in rabbits. Incubation is carried out for 90 minutes at 37°C and the plate is washed. The reaction 
is developed with the appropriate substrate and the reaction is measured by colorimetry 
(absorbance measured spectrophotometrically). Under the above experimental conditions, a 
positive reaction is shown by O.D. values greater than a non immune control serum. 

In a dot blot assay, a purified product is preferred, although a whole cell extract can 
also be used. Briefly, a solution of the product at about 1 00 |ig/ml is serially two-fold diluted in 
50 mM Tris-HCl (pH 7.5). 100 |al of each dilution are applied to a nitrocellulose membrane 0.45 
|im set in a 96- well dot blot apparatus (Biorad). The buffer is removed by applying vacuum to 
the system. Wells are washed by addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air- 
dried. The membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M NaCl, 
10 g/L skim milk) and incubated with an antiserum dilution from about 1:50 to about 1:5000, 
preferably about 1 .500. The reaction is revealed according to standard procedures. For example, 
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a goat anti-rabbit peroxidase conjugate is added to the wells when rabbit antibodies are used. 
Incubation is carried out 90 minutes at 37°C and the blot is washed. The reaction is developed 
with the appropriate substrate and stopped. The reaction is measured visually by the appearance 
of a colored spot, by colorimetry. Under the above experimental conditions, a positive 
reaction is shown once a colored spot is associated with a dilution of at least about 1:5, 
preferably of at least about 1 :500. 

Therapeutic or prophylactic efficacy of a polypeptide or derivative of the invention 
can be evaluated as described below. 

According to a seventh aspect of the invention, there is provided (i) a composition of 
matter containing a polypeptide of the invention together with a diluent or carrier; in particular, 
(ii) a pharmaceutical composition containing a therapeutically or prophylactically effective 
amount of a polypeptide of the invention; (iii) a method for inducing an immune response against 
Chlamydia in a mammal, by administering to the mammal an immunogenically effective amount 
of a polypeptide of the invention to elicit an immune response, e.g., a protective immune 
response to Chlamydia', and particularly, (iv) a method for preventing and/or treating a 
Chlamydia {e.g., C. trachomatis. C. psittaci, C pneumoniae, or C pecorum) infection, by 
administering a prophylactic or therapeutic amount of a polypeptide of the invention to an 
individual in need. Additionally, the seventh aspect of the invention encompasses the use of a 
polypeptide of the invention in the preparation of a medicament for preventing and/or treating 
Chlamydia infection. 

The immunogenic compositions of the invention can be administered by any 
conventional route in use in the vaccine field, in particular to a mucosal (e.g., ocular, intranasal, 
pulmonary, oral, gastric, intestinal, rectal, vaginal, or urinary tract) surface or via the parenteral 
subcutaneous, intradermal, intramuscular, intravenous, or intraperitoneal) route. The 
choice of the administration route depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred and if a lipid formulation or an aluminum compound is used, the 
parenteral route will be preferred. In the latter case, the sub-cutaneous or intramuscular route is 
most preferred- The choice can also depend upon the nature of the vaccine agent. For example, 
a polypeptide of the invention fused to CTB or LTB will be best administered to a mucosal 
surface. 
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A composition of the invention can contain one or several polypeptides or derivatives 
of the invention. It can also contain at least one additional Chlamydia antigen, or a subunit, 
fragment, homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or derivative thereof can be 
5 formulated into or with liposomes, preferably neutral or anionic liposomes, microspheres, 
ISCOMS, or virus-like-particles (VLPs) to facilitate delivery and/or enhance the immune 
response. These compounds are readily available to one skilled in the art; for example, see 
Liposomes: A Practical Approach (supra). 

Adjuvants other than liposomes and the like can also be used and are known in the 
10 art. A appropriate selection can conventionally be made by those skilled in the art, for example, 
from the list provided below. 

Administration can be achieved in a single dose or repeated as necessary at intervals 
as can be determined by one skilled in the art. For example, a priming dose can be followed by 
three booster doses at weekly or monthly intervals. An appropriate dose depends on various 
15 parameters including the recipient (e.g., adult or infant), the particular vaccine antigen, the route 
and frequency of administration, the presence/absence or type of adjuvant, and the desired effect 
(e.g., protection and/or treatment), as can be determined by one skilled in the art. In general, a 
vaccine antigen of the invention can be administered by a mucosal route in an amount from about 
1 0 ng to about 500 mg, preferably from about 1 mg to about 200 mg. For the parenteral route of 
p 20 administration, the dose visually should not exceed about 1 mg, preferably about 100 \ig. 

When used as vaccine agents, polynucleotides and polypeptides of the invention can 
be used sequentially as part of a multistep immunization process. For example, a mammal can 
be initially primed with a vaccine vector of the invention such as a pox virus, e.g., via the 
parenteral route, and then boosted twice with the polypeptide encoded by the vaccine vector, e.g., 
25 via the mucosal route. In another example, liposomes associated with a polypeptide or derivative 
of the invention can also be used for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in combination with a mucosal adjuvant (e.g., 
LT). 

A polypeptide derivative of the invention is also useful as a diagnostic reagent for 
30 detecting the presence of anii-Chlamydia antibodies, e.g., in a blood sample. Such polypeptides 
are about 5 to about 80, preferably about 10 to about 50 amino acids in length and can be labeled 
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or unlabeled, depending upon the diagnostic method. Diagnostic methods involving such a 
reagent are described below. 

Upon expression of a DNA molecule of the invention, a polypeptide or polypeptide 
derivative is produced and can be purified using known laboratory techniques. For example, the 
polypeptide or polypeptide derivative can be produced as a fusion protein containing a fused tail 
that facilitates purification. The fusion product can be used to immunize a small mammal, e.g., a 
mouse or a rabbit, in order to raise antibodies against the polypeptide or polypeptide derivative 
(monospecific antibodies). The eighth aspect of the invention thus provides a monospecific 
antibody that binds to a polypeptide or polypeptide derivative of the invention. 

By ^monospecific antibody" is meant an antibody that is capable of reacting with a 
unique naturally-occurring Chlamydia polypeptide. An antibody of the invention can be 
polyclonal or monoclonal. Monospecific antibodies can be recombinant, e.g., chimeric {e.g., 
constituted by a variable region of murine origin associated with a human constant region), 
humanized (a human immunoglobulin constant backbone together with hypervariable region of 
animal, e.g., murine, origin), and/or single chain. Both polyclonal and monospecific antibodies 
can also be in the form of immunoglobulin fragments, e.g., F(ab) f 2 or Fab fragments. The 
antibodies of the invention can be of any isotype, e.g., IgG or IgA, and polyclonal antibodies can 
be of a single isotype or can contain a mixture of isotypes. 

The antibodies of the invention, which are raised to a polypeptide or polypeptide 
derivative of the invention, can be produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, e.g., Coligan et al., Current Protocols 
in Immunology (1994) John Wiley & Sons, Inc., New York, NY). The antibodies can be used in 
diagnostic methods to detect the presence of a Chlamydia antigen in a sample, such as a 
biological sample. The antibodies can also be used in affinity chromatography methods for 
purifying a polypeptide or polypeptide derivative of the invention. As is discussed further 
below, such antibodies can be used in prophylactic and therapeutic passive immunization 
methods. 

Accordingly, a ninth aspect of the invention provides (i) a reagent for detecting the 
presence of Chlamydia in a biological sample that contains an antibody, polypeptide, or 
polypeptide derivative of the invention; and (ii) a diagnostic method for detecting the presence of 
Chlamydia in a biological sample, by contacting the biological sample with an antibody, a 
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polypeptide, or a polypeptide derivative of the invention, such that an immune complex is 
formed, and by detecting such complex to indicate the presence of Chlamydia in the sample or 
the organism from which the sample is derived. 

Those skilled in the art will understand that the immune complex is formed between a 
5 component of the sample and the antibody, polypeptide, or polypeptide derivative, whichever is 
used, and that any unbound material can be removed prior to detecting the complex. As can be 
easily understood, a polypeptide reagent is useful for detecting the presence of anti-Chlamydia 
antibodies in a sample, e.g. 9 a blood sample, while an antibody of the invention can be used for 
screening a sample, such as a gastric extract or biopsy, for the presence of Chlamydia 
10 polypeptides. 

For use in diagnostic applications, the reagent (i.e., the antibody, polypeptide, or 
polypeptide derivative of the invention) can be in a free state or immobilized on a solid support, 
%1 such as a tube, a bead, or any other conventional support used in the field. Immobilization can 
f-f be achieved using direct or indirect means. Direct means include passive adsorption (non- 
El 15 covalent binding) or covalent binding between the support and the reagent. By "indirect means" 
y is meant that an anti-reagent compound that interacts with a reagent is first attached to the solid 

^ support For example, if a polypeptide reagent is used, an antibody that binds to it can serve as 
Jj7 an anti-reagent, provided that it binds to an epitope that is not involved in the recognition of 

1*=?. 

p antibodies in biological samples. Indirect means can also employ a ligand-receptor system, for 

^ 20 example, a molecule such as a vitamin can be grafted onto the polypeptide reagent and the 
corresponding receptor can be immobilized on the solid phase. This is illustrated by the biotin- 
streptavidin system. Alternatively, indirect means can be used, e.g., by adding to the reagent a 
peptide tail, chemically or by genetic engineering, and immobilizing the grafted or fused product 
by passive adsorption or covalent linkage of the peptide tail. 
25 According to a tenth aspect of the invention, there is provided a process for purifying, 

from a biological sample, a polypeptide or polypeptide derivative of the invention, which 
involves carrying out antibody-based affinity chromatography with the biological sample, 
wherein the antibody is a monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be polyclonal or 
30 monospecific, and preferably is of the IgG type. Purified IgGs can be prepared from an 
antiserum using standard methods (see, e.g., Coligan et ah, supra). Conventional 
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chromatography supports, as well as standard methods for grafting antibodies, are disclosed in, 
e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988). 

Briefly, a biological sample, such as an C. pneumoniae extract, preferably in a buffer 
solution, is applied to a chromatography material, preferably equilibrated with the buffer used to 
dilute the biological sample so that the polypeptide or polypeptide derivative of the invention 
(i.e., the antigen) is allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in a column. The 
unbound components are washed off and the antigen is then eluted with an appropriate elution 
buffer, such as a glycine buffer or a buffer containing a chaotropic agent, e.g., guanidine HC1, or 
high salt concentration (e.g., 3 M MgClj). Eluted fractions are recovered and the presence of the 
antigen is detected, e.g., by measuring the absorbance at 280 run. 

An antibody of the invention can be screened for therapeutic efficacy as described as 
follows. According to an eleventh aspect of the invention, there is provided (i) a composition of 
matter containing a monospecific antibody of the invention, together with a diluent or carrier; (ii) 
a pharmaceutical composition containing a therapeutically or prophylactically effective amount 
of a monospecific antibody of the invention, and (iii) a method for treating or preventing a 
Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae or C. pecontm) infection, by 
administering a therapeutic or prophylactic amount of a monospecific antibody of the invention 
to an individual in need. Additionally, the eleventh aspect of the invention encompasses the use 
of a monospecific antibody of the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

To this end, the monospecific antibody can be polyclonal or monoclonal, preferably 
of the IgA isotype (predominantly). In passive immunization, the antibody can be administered 
to a mucosal surface of a mammal, e.g., the gastric mucosa, e.g., orally or intragastrically, 
advantageously, in the presence of a bicarbonate buffer. Alternatively, systemic administration, 
not requiring a bicarbonate buffer, can be carried out A monospecific antibody of the invention 
can be administered as a single active component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The amount of antibody and the 
particular regimen used can be readily determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one week, or three doses per day of 
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m 



about 100 to 1,000 mg of antibodies over two or three days, can be an effective regimens for 
most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard methods in the 
art, e.g., by measuring induction of a mucosal immune response or induction of protective and/or 
5 therapeutic immunity, using, e.g., the C. pneumoniae mouse model . Those skilled in the art will 
recognize that the C pneumoniae strain of the model can be replaced with another Chlamydia 
strain. For example, the efficacy of DNA molecules and polypeptides from C pneumoniae is 
preferably evaluated in a mouse model using an C pneumoniae strain. Protection can be 
determined by comparing the degree of Chlamydia infection to that of a control group. 
10 Protection is shown when infection is reduced by comparison to the control group. Such an 
evaluation can be made for polynucleotides, vaccine vectors, polypeptides and derivatives 
thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions described above are as follows. 
j~ Adjuvants for parenteral administration include aluminum compounds, such as 

Cj 15 aluminum hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The antigen can 
UJ be precipitated with, or adsorbed onto, the aluminum compound according to standard protocols. 

~~ Other adjuvants, such as RIBI (ImmunoChem, Hamilton, MT), can be used in parenteral 

J^J administration. 

£J Adjuvants for mucosal administration include bacterial toxins, e.g., the cholera toxin 

20 (CT), the E. coli heat-labile toxin (LT), the Clostridium difficile toxin A and the pertussis toxin 
(PT), or combinations, subunits, toxoids, or mutants thereof. For example, a purified preparation 
of native cholera toxin subunit B (CTB) can be of use. Fragments, homologs, derivatives, and 
fusions to any of these toxins are also suitable, provided that they retain adjuvant activity. 
Preferably, a mutant having reduced toxicity is used. Suitable mutants are described, e.g., in WO 
25 95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 95/34323 
(Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants that can be used in the 
methods and compositions of the invention include, e.g., Ser-63-Lys, Ala-69-GIy, Glu-110-Asp, 
and Glu-112-Asp mutants. Other adjuvants, such as a bacterial monophosphoryl lipid A 
(MPLA) of, e.g., E. coli, Salmonella minnesota, Salmonella typhimurium, or Shigella flexneri; 
30 saponins, or polylactide glycolide (PLGA) microspheres, can also be used in mucosal 
administration. 
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Adjuvants useful for both mucosal and parenteral administrations include 
polyphosphazene (WO 95/2415), DC-chol (3 b-(N-(N^N'-&methyl aminomethane)-carbamoyl) 
cholesterol; U.S. Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention, containing a polynucleotide, a 
5 polypeptide, a polypeptide derivative, or an antibody of the invention, can be manufactured in a 
conventional manner. In particular, it can be formulated with a pharmaceutically acceptable 
diluent or carrier, e.g., water or a saline solution such as phosphate buffer saline. In general, a 
diluent or carrier can be selected on the basis of the mode and route of administration, and 
standard pharmaceutical practice. Suitable [ pharmaceutical carriers or diluents, as well as 
10 pharmaceutical necessities for their use in pharmaceutical formulations, are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field and in the USP/NF. 

The invention also includes methods in which Chlamydia infection, are treated by 
oral administration of a Chlamydia polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a combination thereof. Examples of 
15 such compounds that can be administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and derivatives thereof (specific examples 
of antibiotics that can be used include azithromycin or doxicyclin or immunomodulators such as 
cytokines or steroids. In addition, compounds containing more than one of the above-listed 
components coupled together, can be used. The invention also includes compositions for 
jj 20 carrying out these methods, i.e., compositions containing a Chlamydia antigen (or antigens) of 
the invention, an adjuvant, and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and compositions of the 
invention can readily be determined by one skilled in the art. In addition, one skilled in the art 
25 can readily design treatment/immunization schedules. For example, the non-vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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250 255 



265 270 



ggg 



285 290 



300 305 



315 320 



330 335 



345 350 



365 370 



380 385 



Ala 
195 


Thr 


Gly 




att 
He 
He 


aca 
Thr 
Thr 


aat 
Asn 
Asn 


739 


get 
Ala 
Ala 


gca 
Ala 
Ala 


ggt 
Glv 
Gly 


787 


aat 
Asn 
Asn 


acg 
Thr 
Thr 


tct 
Ser 
Ser 
245 


835 


aat 
Asn 
Asn 


gga 
Glv 
Gly 
260 


gga 
Glv 
Gly 


883 


cag 
Gin 
Gin 
275 


agt 
Ser 
Ser 


gta 
Val 
Val 


931 


att 
He 
He 


tat 
Tyr 
Tyr 


get 
Ala 
Ala 


97 9 


ccc 
Pro 
Pro 


ttt 
Phe 
Phe 


tct 
Ser 
Ser 


1027 


gec 
Ala 
Ala 


att 
He 
He 


tct 
Ser 
Ser 
325 


1075 


ggg 

Glv 
Gly 


gac 
Asp 
Asp 
34 0 


cat 
His 
His 


1123 


act. 
Thr 
Thr 
355 


aca 
Thr 
Thr 


aaa 
Lys 
Lys 


1171 


gaa 
Glu 
Glu 


tta 
Leu 
Leu 


cgt 
Arg 
Arg 


1219 


act 
Thr 
Thr 


get 
Ala 
Ala 


aat 
Asn 
Asn 


1267 


get 


gat 


gca 


1315 



33 






Thr 
Thr 
390 


Ala 
Ala 


Ala 
Ala 


Asp 
Asp 


Ser 
Ser 


Thr 
Thr 
395 


Asp 
Asp 


Thr 
Thr 


Leu 
Leu 




ggt 
Gly 
Gly 


aat 
Asn 
Asn 


agt 
Ser 
Ser 


aca 
Thr 
Thr 


gat 
Asp 
Asp 
410 


tat 
Tyr 
Tyr 


agt 
Ser 
Ser 


ggg 

Gly 
Gly 


teg 
Ser 
Ser 




etc 
Leu 
Leu 


tct 
Ser 
Ser 


gaa 
Glu 
Glu 


gat 
Asp 
Asp 
425 


gaa 
Glu 
Glu 


gca 
Ala 
Ala 


aaa 
Lys 
Lys 


gtt 

Val 
val 


gca 
Ala 
Ala 
430 




aag 
Lys 
Lys 


cag 
Gin 
Gin 


cct 
Pro 
Pro 
440 


gta 
Val 
Val 


act 
Thr 
Thr 


eta 
Leu 
Leu 


act 
Thr 
Thr 


gca 
Ala 
Ala 
445 


gga 
Gly 
Gly 




gtc 
Val 
Val 


act 
Thr 
Thr 
455 


etc 
Leu 
Leu 


gat 
Aso 
Asp 


acg 
Thr 
Thr 


aaa 
Lvs 
Lys 


ggc 
Gly 
Gly 
4 60 


ttt 
Phe 
Phe 


act 
Thr 
Thr 


y i 

f == % 


att 
He 
He 
470 


atg 
Met 
Met 


gat 
Asp 
Asp 


gcg 
Ala 
Ala 


ggc 
Glv 
Gly 


aca 
Thr 
Thr 
475 


acg 
Thr 
Thr 


tta 
Leu 
Leu 


aaa 
Lys 
Lys 


Q 

liJ 


tta 
Leu 


aca 
Thr 
Thr 


ggt 
Gly 
Gly 


ctt 
Leu 


tec 
Ser 
Ser 
490 


att 
He 
He 


cct 
Pro 
Pro 


gta 
Val 
Val 


gac 
Asp 
Asp 


m 


gtt 
Val 
Val 


gta 
Val 
Val 


att 
He 
He 


get 
Ala 
Ala 
505 


get 
Ala 
Ala 


tct 
Ser 
Ser 


gca 
Ala 
Ala 


gca 
Ala 
Ala 


agt 
Ser 
Ser 
510 




ccg 
Pro 
Pro 


att 
He 
He 


ctt 
Leu 
Leu 
520 


ctt 
Leu 
Leu 


ttg 
Leu 
Leu 


gat 
Asp 
Asp 


aac 
Asn 
Asn 


caa 
Gin 
Gin 
525 


ggg 

Glv 
Gly 




tta 
Leu 
Leu 


gga 
Glv 
Gly 
535 


aaa 
Lys 
Lys 


act 
Thr 
Thr 


caa 
Gin 
Gin 


gac 
Asp 
Asp 


ttt 
Phe 
Phe 
540 


tea 
Ser 
Ser 


ttt 
Phe 
Phe 




act 
Thr 
Thr 
550 


gca 
Ala 
Ala 


aca 
Thr 
Thr 


act 
Thr 
Thr 


aca 
Thr 
Thr 


gat 
Asd 
Asp 
555 


gtt 
Val 
Val 


cca 
Pro 
Pro 


gcg 
Ala 
Ala 




acg 
Thr 
Thr 


cac 
His 
His 


tat 
Tyr 
Tyr 


ggg 

Gly 
Gly 


tat 
Tyr 
Tyr 
570 


caa 
Gin 
Gin 


ggt 
Gly 
Gly 


act 
Thr 
Thr 


tgg 
Trp 
Trp 




acc 
Thr 
Thr 


gca 
Ala 
Ala 


age 
Ser 
Ser 


act 
Thr 
Thr 
585 


cca 
Pro 
Pro 


aag 
Lys 
Lys 


act 
Thr 
Thr 


aag 
Lys 
Lys 


aca 
Thr 
Thr 
590 
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Asn 
Asn 


Leu 
Leu 
400 


Asn 
Asn 


Lys 
Lys 


Ala 
Ala 


Asp 
Asp 


Ala 
Ala 
405 




att 
He 
He 
415 


gtt 
Val 
Val 


ttt 
Phe 
Phe 


tct 
Ser 
Ser 


ggt 
Gly 
Gly 


gaa 
Glu 
Glu 
420 


aag 
Lys 
Lys 


1363 


gac 
Asp 
Asp 


aac 
Asn 
Asn 


etc 
Leu 
Leu 


act 
Thr 
Thr 


tct 
Ser 
Ser 
4 35 


acg 
Thr 
Thr 


ctg 
Leu 
Leu 


1411 


aat 
Asn 
Asn 


tta 
Leu 
Leu 


gta 
Val 
Val 


ctt 
Leu 
Leu 
450 


aaa 
Lvs 

" Jr 

Lys 


cgt 
Ara 
Arg 


ggt 
Glv 
Gly 


1459 


cag 
Gin 
Gin 


acc 
Thr 
Thr 


gcg 
Ala 
Ala 
4 65 


ggt 
Glv 
Gly 


tec 
Ser 
Ser 


tct 
Ser 
Ser 


gtt 
Val 
Val 


1507 


gca 
Ala 
Ala 

! 


agt 
Ser 
Ser 
480 


aca 
Thr 
Thr 


gag 
Glu 
Glu 


gag 
Glu 
Glu 


gtc 
Val 
Val 


act 
Thr 
Thr 
485 


1555 


tct 
Ser 
Ser 
4 95 


tta 
Leu 


ggc 
Gly 
Gly 


gag 
Glu 
Glu 


ggt 
Gly 
Gly 


aag 

Lys 
500 


aaa 
Lys 


1603 


aaa 
Lys 


aat 
Asn 
Asn 


gta 
Val 
Val 


gee 
Ala 
Ala 


ctt 
Leu 
Leu 
515 


agt 
Ser 
Ser 


ggt 
Gly 
Gly 


1651 


aat 
Asn 


get 
Ala 
Ala 


tat 
iyx 
Tyr 


gaa 
Glu 
Glu 
530 


aat 
Asn 
Asn 


cac 
His 
His 


gac 
Asp 
Asp 


1699 


gtg 
Val 
Val 


cag 
Gin 
Gin 


etc 
Leu 
Leu 
545 


tct 
Ser 
Ser 


get 
Ala 
Ala 


ctg 
Leu 
Leu 


ggt 
Glv 

VJX Jr 

Gly 


1747 


gtt 
Val 
Val 


cct 
Pro 
Pro 
560 


aca 
Thr 
Thr 


gta 
Val 
Val 


gca 
Ala 
Ala 


act 
Thr 
Thr 


cct 
Pro 
Pro 
565 


1795 


gga 
Gly 
Gly 
575 


atg 
Met 
Met 


act 
Thr 
Thr 


tgg 
Trp 
Trp 


gtt 

Val 
Val 


gat 
Asp 
Asp 
580 


gat 
Asp 
Asp 


1843 


gcg 
Ala 
Ala 


aca 
Thr 
Thr 


tta 
Leu 
Leu 


get 
Ala 
Ala 


tgg 
Trp 
Trp 


acc 
Thr 
Thr 


aat 
Asn 
Asn 


1891 



595 



34 



ft 



m 



ccg 


aat 


cct 


Pro 


Asn 


riO 


Pro 


Asn 


Pro 


tct 


ttt 


tea 


Ser 


Fne 




Ser 


Phe 


Ser 






620 


ttg 


act 


ctt 


Leu 


Thr 


Leu 


Leu 


Thr 


Leu 




635 




ttc 


tta 


gat 


Phe 


Leu 


Asp 


Phe 


Leu 


Asp 



600 605 610 



615 620 625 



630 635 640 645 



650 655 660 

tac cgt cat aaa tct ggt gga tat get ate gga ggt gca gcg caa act 

Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly Gly Ala Ala Gin Thr 

Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly Gly Ala Ala Gin Thr 
665 670 675 

tgt tct gaa aac tta att age ttt gee ttt tgc caa etc ttt ggt age 
Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 
Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 
680 685 690 



aaa 


gat 


ttc 


tta 


gtc 


get 


aaa 


aat 


cat 


act 


gat 


acc 


tat 


gca 


gga 


Lys 


Asp 


Phe 


Leu 


Val 


Ala 


Lys 


Asn 


His 


Thr 


Asp 


Thr 


Tyr 


Ala 


Gly 


Lys 


Asp 


Phe 


Leu 


Val 


Ala 


Lys 


Asn 


His 


Thr 


Asp 


Thr 


Tyr 


Ala 


Gly 


695 








700 










705 










ttc 


tat 


ate 


caa 


cac 


att 


aca 


gaa 


tgt 


agt 


ggg 


ttc 


ata 


ggt 


tgt 


Phe 


Tyr 


lie 


Gin 


His 


He 


Thr 


Glu 


Cys 


Ser 


Gly 


Phe 


He 


Gly Cys 


Phe 


Tyr 


lie 


Gin 


His 


He 


Thr 


Glu 


Cys 


Ser 


Gly 


Phe 


He 


Gly 


Cys 



710 715 720 725 



735 740 



745 750 755 

tat act gcg tat cct gag gtg aaa ggt tct tgg ggg aat aat get ttt 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn Ala Phe 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn Ala Phe 
760 765 770 



etc 


tta 


gat 


aaa 


ctt 


cct 


ggc 


Leu 


Leu 


Asp 


Lys 


Leu 


Pro 


Gly 


Leu 


Leu 


Asp 


Lys 


Leu 


Pro 


Gly 










730 






gaa 


ggg 


cag 


etc 


get 


tat 


age 


Glu 


Gly 


Gin 


Leu 


Ala 


Tyr 


Ser 


Glu 


Gly 


Gin 


Leu 


Ala 


Tyr 


Ser 



aac 


atg 


atg 


ttg 


gga 


get 


tct 


tct 


cat 


tct 


tat 


cct 


gaa 


tac 


ctg 


Asn 


Met 


Met 


Leu 


Gly 


Ala 


Ser 


Ser 


His 


Ser 


Tyr 


Pro 


Glu 


Tyr 


Leu 


Asn 


Met 


Met 


Leu 


Gly 


Ala 


Ser 


Ser 


His 


Ser 


Tyr 


Pro 


Glu 


Tyr 


Leu 




775 








780 










785 








tgt 


ttt 


gat 


acc 


tat 


get 


cca 


tac 


ate 


aaa 


ctg 


aat 


ctg 


acc 


tat 


Cys 


Phe 


Asp 


Thr 


Tyr 


Ala 


Pro 


Tyr 


He 


Lys 


Leu 


Asn 


Leu 


Thr 


Tyr 


Cys 


Phe 


Asp 


Thr 


Tyr 


Ala 


Pro 


Tyr 


He 


Lys 


Leu 


Asn 


Leu 


Thr 


Tyr 


790 








795 










800 
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1939 

1987 
2035 
2083 



2131 



2179 



2227 



2275 



2323 



2371 



2419 



2467 



2515 



805 
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cgt 


cag 


gac 


age 


ttc 


teg 


gag 


aaa 


ggt 


aca 


gaa 


gga 


aga 


tct 


ttt 


gat 






Axg 


Gin 


Asp 


Ser 


Phe 


Ser 


Glu 


Lys 


Gly 


Thr 


Glu 


Gly 


Arg 


Ser 


Phe 


Asp 






Arg 


Gin 


Asp 


Ser 


Phe 


Ser 


Glu 


Lys 


Gly 


Thr 


Glu 


Gly 


Arg 


Ser 


Phe Asp 










810 










815 










820 








gac 


age 


aac 


etc 


ttc 


aat 


tta 


tct 


ttg 


cct 


ata 


999 


gtg 


aag 


ttt 


gag 


Z D XX 




Asp 


Ser 


Asn 


Leu 


Phe 


Asn 


Leu 


Ser 


Leu 


Pro 


He 


Gly 


Val 


Lys 


Phe 


Glu 






Asp 


Ser 


Asn 


Leu 


Phe 


Asn 


Leu 


Ser 


Leu 


Pro 


He 


Gly 


Val 


Lys 


Phe 


Glu 










825 










830 










835 










aag 


ttc 


tct 


gat 


tgt 


aat 


gac 


ttt 


tct 


tat 


gat 


ctg 


act 


tta 


tec 


tat 






Lys 


Phe 


Ser 


Asp 


Cys 


Asn 


Asp 


Phe 


Ser 


Tyr 


Asp 


Leu 


Thr 


Leu 


Ser 


Tyr 






Lvs 


Phe 


Ser 


Asp 


Cys 


Asn 


Asp 


Phe 


Ser 


Tyr 


Asp 


Leu 


Thr 


Leu 


Ser 


Tyr 








840 










845 










850 












gtt 


cct 


gat 


ctt 


ate 


cgc 


aat 


gat 


ccc 


aaa 


tgc 


act 


aca 


gca 


ctt 


ata 


2707 




Val 


Pro 


Asp 


Leu 


lie 


Arg 


Asn 


Asp 


Pro 


Lys 


Cys 


Thr 


Thr 


Ala 


Leu 


Val 






Val 


Pro 


Asp 


Leu 


lie 


Arg 


Asn 


Asp 


Pro 


Lys 


Cys 


Thr 


Thr 


Ala 


Leu 


Val 








855 








860 










865 














ate 


age 


gga 


gee 


tct 


tgg 


gaa 


act 


tat 


gee 


aat 


aac 


tta 


gca 


ega 


cag 


2755 


A 


lie 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 






lie 


Ser 


Glv 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn 


Leu 


Ala 


Arg 


Gin 




CP 


870 








875 










880 










885 




Is™ 


gec 


ttg 


caa 


gtg 


cgt 


gca 


ggc 


agt 


cac 


tac 


gec 


ttc 


tct 


cct 


atg 


ttt 


2803 




Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe 


Ser 


Pro 


Met 


Phe 






Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe 


Ser 


Pro 


Met 


Phe 




IsscS 

■4? 










890 










8 95 










900 






Id 


gaa 


gtg 


etc 


ggc 


cag 


ttt 


gtc 


ttt 


gaa 


gtt 


cgt 


gga 


tec 


tea 


egg 


att 


2851 


08 


Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly 


Ser 


Ser 


Arg 


He 




a 


Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly 


Ser 


Ser 


Arg 


He 












905 










910 










915 








& 


tat 


aat 


gta 


gat 


ctt 


ggg 


ggt 


aag 


ttc 


caa 


ttc 


taggagegtc 


tctcatgtct 


2904 


Tfrr? 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 














Ste=V 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 














Ml 




920 










925 





















CO 

cagaaattct gagagagatc gcatttagga ttttcttaaa cacgac 2950 
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Figure 2 

Restriction enzyme analysis of CPN100635 



Msel 
I 



Apol 
Tsp509I 
Hpyl78III 
Alul 
CviJI 
MspAlI 
PvuII 
I 



Ddel 
Hpyl88IX 
Hinfl 
Hpyl78III I 
BseMIIl I 
Plel I I I 
i III 



AGTTAAAGATGACAAAACAGCTGTCAAGAATTTTTATCTTGACTCTCTGAGTTTTCTATT 



-+ 60 



TCAATTTCTACTGTTTTGTCGACAGTTCTTAAAAATAGAACTGAGAGACTCAAAAGATAA 



ScrFI 
EcoRII | 



61 



Apol 
Tsp509I 

Hqal | Msel Tsp509I I I 

til III 
T TAT AT GAC GCAAG T AAG AAT T T AAT AAT AAAG T GGGT T TATGAAAT CG CAAT T T T CC T G 

+ + + + + + 120 

AAT AT AC T G C GT T C AT T C T T AAAT T ATT AT T TC ACCCAAAT AC T T T AG CGT T AAAAGGAC 



Mwol 
CviRI 

BsiHKAI Fnu4HI I 

Bspl286I Earl Nlalll Bfal TspRI I I 

MboII I TaqI | MslI Nspl Spell Bbvl Taal | Tsel | | 

| [ I I I I II I I I II I 

GTTAGTGCTCTCTTCGACATTGGCATGTTTTACTAGTTGTTCCACTGTTTTTGCTGCAAC 



121 



+ 180 



CAATCACGAGAGAAGCTGTAACCGTACAAAATGATCAACAAGGTGACAAAAACGACGTTG 

Hpyl88IX 
NlalV 
CviJI | 
Haellll 
EcoOl09l| I 
Sau96l| I 
I I 



Mnll 
Alul I 
CviJI | 



Rsal 
Seal 
TatI I 



181 



NlalV 
BanI I BsaXI 
I III 
TGC TGAAAATAT AGGCCCCTC T GAT AGCTT T G ACGGAAGTACT AACACAGGCACCTATAC 

+ + + + + + 240 

ACGACTTTTATATCCGGGGAGACTATCGAAACTGCCTTCATGATTGTGTCCGTGGATATG 



Cjel 

C i eI BsrI Hpyl88IX CviRI 

I I I 1 

T CC T AAAAAT ACGAC TACT G GAAT AG AC TAT ACT C T GAC AGG AG AT AT AACTC T GCAAAA 



241 



AG GAT 



TTTTATGCTGATGACCTTATCTGATATGAGACTGTCCTCTATATTGAGACGTTTT 



BsaJI 
Styl 

301 



Msel 
Alul 
CviJI 
Hinfl Fnu4HI | 
Tfil Tsel I I 
I I I 



Alul 
CviJI 
Hinfl Hindlll I 
Bbvl Hpyl88IX Tfil Msel I t 

I I t III 

CCTTGGGGATTCGGCAGCTTTAACGAAGGGTTGTTTTTCTGACACTACGGAATCTTTAAG 

GGAACCCCTAAGCCGTCGAAATTGCTTCCCAACAAAAAGACTGTGATGCCTTAGAAATTC 



360 



Fnu4HI 
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Hhal 1 

Dral Msel Tsel I 

BslI Rsal MsellSspI | Bfal Mwol I I 

M I I I I I 1 I IN 

CTTTGCCGGTAAGGGGTACTCACTTTCTTTTTTAAATATTAAGTCTAGTGCTGAAGGCGC 



Mspl 
BsrFI | 



361 



GAAACGGCCATTCCCCATGAGTGAAAGAAAAAATTTATAATTCAGATCACGACTTCCGCG 



-+ 420 



CviJI 



Eco57I 

Bbvl 1 Hinf I 

MaelUM BsaBI TaqI | Plel 

I III I lit 

AGCCCTTTCTGTTACAACTGATAAAAATCTGTCGCTAACAGGATTTTCGAGTCTTACTTT 

421 + + + + + + 

TCGGGAAAGACAATGTTGACTATTTTTAGACAGCGATTGTCCTAAAAGCTCAGAATGAAA 



480 



Bed 
NlalV 
CviJI 
Haelll 
Sau96I 
Fnu4HI I 
Taul | 
Acill I 
Cjel Ml 
Ddell | | I 
II 111 



481 



Hpyl78III 
Cjel | 
Bsu36I | I 
BslI | Ddel I I 
I I II 

CTTAGCGGCCCCATCATCGGTAATCACAACCCCCTCAGGAAAAGGTGCAGTTAAATGTGG 

+ + + + + + 

GAATCGCCGGGGTAGTAGCCATTAGTGTTGGGGGAGTCCTTTTCCACGTCAATTTACACC 



Mnll 
Msel 
BseMII I 
CviRI | | 
Mnll I 1 I 
I III 



540 



Bsgl 
Dpnl 
BstYI I 
Sau3AI | 



541 



Dral Taal 
Alwl Msel | Mnll I 

t I I I I 

AG GG GAT C T T ACAT T TG AT AAC AAT GG AAC TAT T T TAT T T AAACAAGAT TACT G T GAG GA 

+ + + + + + 

TCCCCTAGAATGTAAACTATTGTTACCTTGATAAAATAAATTTGTTCTAATGACACTCCT 



600 



601 



Clal 

CviJI TaqI 
NlalVI Dpnl I 

Ecil I I Hinfl Sau3AI I I 

Ac il| || Tfil Sthl32I BscGI I II Alwl 

II II I I It II I 

AAATGGCGGAGCCATTTCTACCAAGAATCTTTCTTTGAAAAACAGCACGGGATCGATTTC 

+ + + + + + 660 

TTTACCGCCTCGGTAAAGATGGTTCTTAGAAAGAAACTTTTTGTCGTGCCCTAGCTAAAG 



Hhal 

CjePIl CjePI 
TaqI I 1 CviJI Mwol I 

I II III 

TTTTGAAGGGAATAAATCGAGCGCAACAGGGAAAAAAGGTGGGGCTATTTGTGCTACTGG 

661 + + + + + + 720 

AAAACTTCCCTTATTTAGCTCGCGTTGTCCCTTTTTTCCACCCCGATAAACACGATGACC 



Taal 
Sfcl I 



MboII 



BspMI 
Sspl 
Bbvl | 
Mnll I I 
Earl | | | 
Hpyl78III | I I 



Fnu4HI 
Sfcl 
Alul | 
CviJI I 
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Bsr j II NlalVI TaqI III I Mwol I 

Rs al || CviJI I I Beef I I I I I I Tsel | 

III III I I 1 I I I II 

TACTGTAGATATTACAAATAATACGGCTCCTACCCTCTTCTCGAACAATATTGCTGAAGC 

, . + + + + 780 



721 



ATGACATCTATAATGTTTATTATGCCGAGGATGGGAGAAGAGCTTGTTATAACGACTTCG 



Eco57I 
Alul 
CviJI 
Mwol 
Aarl I 
Pstll I 
CviRI | I I 
1 I 1 



781 



Tsp509I 
Rsal 
BsrGI ! 

Taal I I BsmAI 
TatI | | Maell BsmBI 

III I I 

TGCAGGTGGAGCTATAAATAGCACAGGAAACTGTACAATTACAGGGAATACGTCTCTTGT ^ 

ACGTCCACCTCGATATTTATCGTGTCCTTTGACATGTTAATGTCCCTTATGCAGAGAACA 

BseRI 
Hpyl78III 
Banll 
BsiHKAI 
Bspl286I 
SacI 

* SfaNI 
Acil Alul I 

BsiEI Mnll CviJI I I ICjel 

II II 



Hpyl88IX 



Maelll 
Tsp45I 



ATTTTC 



TGAAAATAGTGTGACAGCGACCGCAGGAAATGGAGGAGCTCTTTCTGGAGATGC 



841 



rAAAAGACTTTTATCACACTGTCGCTGGCGTCCTTTACCTCCTCGAGAAAGACCTCTACG 



900 



Maelll 
Hpyl88IX 
Hinfl I 
BslI I I 
Bpml (Tfil I 
I I I I 



BseMII 
Sfcl 
Alul I Alul 
CviJI | CviJI 
I I I 



901 



Hpyl78III 

Maelll Bpml (Tfil I ICjel Ddel I 

i | I I I II » I 

CGATGTTACCATATCTGGGAATCAGAGTGTAACTTTCTCAGGAAACCAAGCTGTAGCTAA 

. ^ »-— — —————— — + I — — — ■+-— ~ — — *" 960 

GCTACAATGGTATAGACCCTTAGTCTCACATTGAAAGAGTCCTTTGGTTCGACATCGATT 

Mcil 
ScrFI 
BsaJI 
Mspl 
BsrI 
TspRI 
CviJI 



CviJI 
NlalVI 
Ecil I I 
Acil I I I Mwol 
I I 



Sthl32I 
Alul 
CviJI 
Hindi I I I 
Ddel I I 

I I I 

I I 



BslI 
I 



Hinfl 
Tfil 



TGGCGGAGCCATTTATGCTAAGAAGCTTACACTGGCTTCCGGGGGGGGGGGGGGGAATCC 

961 + + + + + + 1020 

ACCGCCTCGGTAAATACGATTCTTCGAATGTGACCGAAGGCCCCCCCCCCCCCCCTTAGG 

PstI 
TspRI 
CviRI | 
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HgiEII 
BtsI 
Sfcl 
Kpnl 
NlalV 
Rsal 
BanI 



BspMI 
BsaJI I 
Styl I 
Cjell I 
I I 



CviJI Xcml 
NlalVl Cjel 1 
II II 



1021 



CTTTTCTAACAATATAGTCCAAGGTACCACTGCAGGTAATGGTGGAGCCATTTCTATACT 

+ + + -+ + + 1080 

GAAAAGATTGTTATATCAGGTTCCATGGTGACGTCCATTACCACCTCGGTAAAGATATGA 



1081 



Alul 
CviJI 
MspAlI 
PvuII 

BsrI | NlalV BsmFI 

Fnu4HI I Avail | Msel | 

T se l I | Bbvl Bpml Sau96I| Xcml t I 

Ml I I M M I 

GGCAGCTGGAGAGTGTAGTCTTTTCAGCGAAGCAGGGGACCATTACCTTAATGGGAATGC 

. + + + + + 1140 



CCGTCGACCTCTCACATCAGAAAAGTCGCTTCGTCCCCTGGTAATGGAATTACCCTTACG 



CviRI 
Mwol I 
BsmI | I 
I I 



Apol 
Tsp509I 



Dpnl 
BstYI 
Sau3AI 
HaelV I 
Hin4I | 
I I 



Bcgl 
Alwl | 
I I 



1141 



CATTGTTGCAACTACACCACAAACTACAAAAAGAAATTCTATTGACATAGGATCTACTGG 

+ + + + + + 1200 

GTAACAACGTTGATGTGGTGTTTGATGTTTTTCTTTAAGATAACTGTATCCTAGATGACC 



Tsp509I 
Hpyl78III | 

Dpnl I I CviRI 
Sau3AI | I I BsaAI I 
BsrI I I I IMaelll I 
III M 



Bcgl 



Hpyl88IX 
Dpnl 
Sau3AI 
Pf 111081 | 
SfaNI I I 
Alwl | | | 
I 



Bbvl 



1201 



CAAAGATCACGAATTACGTGCAATATCTGGGCATAGCATCTTTTTCTACGATCCGATTAC 

_ . — — — i _-——————+——— — — — — — — + 1260 

GTTTCTAGTGCTTAATGCACGTTATAGACCCGTATCGTAGAAAAAGATGCTAGGCTAATG 



Hinf I 
Tfil 
Acil 
Fnu4HI 
CviJI | 
Tsel | 
Mwol | | 
I I 



1261 



CviJI 

Bcefl Dral BspMI I 

I Sfcl | Msel | SfaNI I I CviRI 

II II III I 

TGCTAATACGGCTGCGGATTCTACAGATACTTTAAATCTCAATAAGGCTGATGCAGGTAA ^ 



GATTATGCCGACGCCTAAGATGTCTATGAAATTTAGAGTTATTCCGACTACGTCCATT 



AC' 



Rsal 



TaqI 



Hpyl88IX 
HphI | 
Alul I I 
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TatI | SimI | CviJI I I 

II II I "I 

TAGTACAGATTATAGTGGGTCGATTGTTTTTTCTGGTGAAAAGCTCTCTGAAGATGAAGC 

1321 + + + + + + 1380 

ATCATGTCTAATATCACCCAGCTAACAAAAAAGACCACTTTTCGAGAGACTTCTACTTCG 



Eco57I Maelll 

CviRI I CviJI 

BstAPI I I Fnu4HI I 

Mwol I I Mv/ol I I 

MboII I I I Mnll Tsel I I 

1111 I Ml 

AAAAGTTGCAGACAACCTCACTTCTACGCTGAAGCAGCCTGTAACTCTAACTGCAGGAAA 

1381 + + + + + + 1440 

TTTTCAACGTCTGTTGGAGTGAAGATGCGACTTCGTCGGACATTGAGATTGACGTCCTTT 



Apol 
Tsp509I 
PstI 
CviRI 
Sfcl | 
Eco57I| | 
Bbvl I | 1 
I III 



BseMII 
NlalV 

SacII 

Acil 



Maell 
Msel I 
Rsal 1 1 
Seal I 
TatI I I 
I I I 



1 Hpyl78III 
IMaelll I 
|Tsp45I TaqI 
I I I 



MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 
Hpyl88IX 
Faul | 
Ddel | | 
Sthl32I| | 
CviJI I I I 

I III 



TTTAGTACTTAAACGTGGTGTCACTCTCGATACGAAAGGCTTTACTCAGACCGCGGGTTC 



1441 



-+ 1500 



AAATCATGAATTTGCACCACAGTGAGAGCTATGCTTTCCGAAATGAGTCTGGCGCCCAAG 



Faul 
Mnll 
Sthl32I | 
SfaNI I I 
I I 



Msel 
Acil 
Fokl 
Mae 1 1 
Bspl286l 
Bmgl 
BseSI 
Cac8I I 
Acil | I 
I I I 



Mnll 
Rsal 
TatI | 
Mnll I I 
I I I 



Maelll 
Tsp45I 



BseRI 
Msel I 
I I 



C TC T G T TAT T AT GG AT GCG G GC AC AAC G T T AAAAGC AAG TACAG AG GAGGTC AC T T T AAC 



1501 



+ 1560 



GAGACAATAATACCTACGCCCGTGTTGCAATTTTCGTTCATGTCTCCTCCAGTGAAATTG 



1561 



Hinfl 
AccI | 

Sfcl | I Fnu4HI Sfcl 

Plel I I I Mnll Bbvl Tsp509I Tsel I Mwol I 

Mill I I I I I I 

AGGTCTTTCCATTCCTGTAGACTCTTTAGGCGAGGGTAAGAAAGTTGTAATTGCTGCTTC 

+ + + + + + 

TCCAGAAAGGTAAGGACATCTGAGAAATCCGCTCCCATTCTTTCAACATTAACGACGAAG 



1620 



PstI 
Fnu4HI 1 
CviRI | | 



Hinfl 
Tfil 
Hpyl88IX | 
Avail I I 
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Mwol | | Ddel MboII I 1 BsaJI 

Tsel | I Bbvl CviJI |Sau96I I I Styl 

Ml t II III I 

TGCAGCAAGTAAAAATGTAGCCCTTAGTGGTCCGATTCTTCTTTTGGATAACCAAGGGAA 

1621 + + + + + + 1680 

ACGTCGTTCATTTTTACATCGGGAATCACCAGGCTAAGAAGAAAACCTATTGGTTCCCTT 

Alul 
CviJI 

Ddel Fnu4HI I 

Bce83I | Hpyl78III CviRI | | 

BsmI Hpyl78III | Smll | Tsel | | 

I II II II I 

TGCTTATGAAAATCACGACTTAGGAAAAACTCAAGACTTTTCATTTGTGCAGCTCTCTGC 

1681 + + + + + + 1740 

ACGAATAC TT T TAG T GC T GAAT CC x T T T T GAGT TC T G AAAAG T AAAC ACGT CG AG AG ACG 

Sfcl 

CviRI NlalV } 

Acelll Bsgl I Acil | | Cjel 

Bbvl I Rsall 1 Sfcl MspAlI I I Taal I 

11 Ml t I II II 

TCTGGGTACTGCAACAACTACAGATGTTCCAGCGGTTCCTACAGTAGCAACTCCTACGCA 

1741 + + + + + + 1800 

AGACCCATGACGTTGTTGATGTCTACAAGGTCGCCAAGGATGTCATCGTTGAGGATGCGT 

Cjel CacSI 
Rsal | Acil | 

II t I 

CTATGGGTATCAAGGTACTTGGGGAATGACTTGGGTTGATGATACCGCAAGCACTCCAAA 

1801 + + + + + + 1860 

GATACCCATAGTTCCATGAACCCCTTACTGAACCCAACTACTATGGCGTTCGTGAGGTTT 

BpulOI 
Ddel 
Hpyl78III 
Hinfl 

Avail Tfil 
Sau96I Hgal 
Alul | Hpyl88IX 
Tthlllll CviJI | BseMII I 

Ddel I Mwol | | CviJI | I 

11 III III 

G ACT AAG ACAG CG AC AT T AGCT T GG ACCAATACAGG C T ACC T TC CGAAT CC T GAGC GT C A 

1861 + + + + + + 1920 

CTGATTCTGTCGCTGTAATCGAACCTGGTTATGTCCGATGGAAGGCTTAGGACTCGCAGT 

Avail 

Eco0109I Dpnl 
PspSII BstYI | 

Sau96I Fokl |Hpyl88IX Hinfl 

Sse8647I Alol CviJI Sau3AI | Alwl | Tfil 

II I II II I 

AGGACC T T TAG T T CCTAAT AGCC T TT G GGG AT CT T T T T CAG AC AT CCAAGCG AT T CAAGG 

1921 + + + + + + 1980 

TCCTGGAAATCAAGGATTATCGGAAACCCCTAGAAAAAGTCTGTAGGTTCGCTAAGTTCC 

Mwol 
Hinfl 

Bbvl Acil 
TaqI Fnu4HI | 

Dpnl | CviJI | | 

Sau3AI I I Faul I I I 

Hpyl88IX| I | Sthl32I| I I I 
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1981 



Plel Hinfl Mnllll MCviJI \ I Tsel I I I 1 

i I Mill III Ml II 

TGTCATAGAGAGAAGTGCTTTGACTCTTTGTTCAGATCGAGGCTTCTGGGCTGCGGGAGT 

+ + . + + + + 2040 

AC AGT AT CT C T C T T C ACG AAAC T GAG AAAC AAGT C T AGCT CC GAAG ACC CG AC GCC C T CA 



Plel 

Tsp509I| Ddel Taal 

Ml I 
CGCCAATTTCTTAGATAAAGATAAGAAAGGGGAAAAACGCAAATACCGTCATAAATCTGG 

2041 + + + + + + 2100 

GCGGTTAAAGAATCTATTTCTATTCTTTCCCCTTTTTGCGTTTATGGCAGTATTTAGACC 



Hpyl88IX Tsp509I 
Mnll 1 Fnu4HI Bsgl I 

HaelVI I CviEUI Hpyl88IX I I Alul 

Hin4I| I TsellHhal Bbvl I I Msel I CviJI 

I t I lit III 1 I I 

T G G AT AT GCT AT C GG AG GTGCAG C GC AAACT T G T T C T G AAAACT TAAT T AGC T TTG CC T T 

2101 + + + + + + 2160 

ACCTATACGATAGCCTCCACGTCGCGTTTGAACAAGACTTTTGAATTAATCGAAACGGAA 



BstXI Ddel 

1 I 

TTGCCAACTCTTTGGTAGCGATAAAGATTTCTTAGTCGCTAAAAATCATACTGATACCTA 

2161 + + + + + + 2220 

AACGGTTGAGAAACCATCGCTATTTCTAAAGAATCAGCGATTTTTAGTATGACTATGGAT 



Mr* 
&4 



CviJI 

NlaIV| BsmAI 
CviRI I I Bsbl Mmel Ddel I 

111 I I M 

TGCAGGAGCCTTCTATATCCAACACATTACAGAATGTAGTGGGTTCATAGGTTGTCTCTT 

2221 + + + + + + 2280 

ACGTCCTCGGAAGATATAGGTTGTGTAATGTCTTACATCACCCAAGTATCCAACAGAGAA 

Cac8I 
Alul 

CviJI CviJI 
ScrFI | Fnu4HI I 

EcoRII | | Hinfl Plel Mnll Tsel I I 

111 I I I Ml 

AGATAAACTTCCTGGCTCTTGGAGTCATAAACCCCTCGTTTTAGAAGGGCAGCTCGCTTA 



2281 



-+ 2340 



TCTATTTGAAGGACCGAGAACCTCAGTATTTGGGGAGCAAAATCTTCCCGTCGAGCGAAT 

Bsu36I 
Ddel 
Hpyl7 8III 
Eco57I 
BseMII 

Maell ^3boII 
CviJI I Hpyl88lX BstZ17I| 

Acellll I Dpnl I AccI M 

Bbvl | | I Sau3AI I I Bbsl III ! Mnll I I BciVI 

(Ml III I 1 M I I 

TAGCCACGTCAGTAATGATCTGAAGACAAAGTATACTGCGTATCCTGAGGTGAAAGGTTC 

2341 + + + + + + 2400 

AT CGGTGCAGTCAT TACT AGACT TCTGT T TC ATATGACGCAT AGGACT CCACTTTCCAAG 



MboII Alul 

HphI Msel Nlalll 1 CviJI Hpyl78III 

I I I I I I 

TTGGGGGAATAATGCTTTTAACATGATGTTGGGAGCTTCTTCTCATTCTTATCCTGAATA 
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2401 + + + + + + 2460 

AACCCCCTTATTACGAAAATTGTACTACAACCCTCGAAGAAGAGTAAGAATAGGACTTAT 

Hpyl88IX 

BspMI Hinfl | Hpyl78III 
CviRI | BsaXI Tfil | Maell I 
•I 1 II II 
CCTGCATTGTTTTGATACCTATGCTCCATACATCAAACTGAATCTGACCTATATACGTCA 
2461 + + + + + + 2520 

GGACGTAACAAAACTATGGATACGAGGTATGTAGTTTGACTTAGACTGGATATATGCAGT 

Hpyl88IX Dpnl 
Alul | Bglll | 
CviJI | BstYI | MboII Earl 
Hin4I | Rsal Sau3AI | MboII | Tsp509I | 
ft I II II 11 
GGACAGCTTCTCGGAGAAAGGTACAGAAGGAAGATCTTTTGATGACAGCAACCTCTTCAA 
2521 + + + + + + 2580 

CCTGTCGAAGAGCCTCTTTCCATGTCTTCCTTCTAGAAAACTACTGTCGTTGGAGAAGTT 

Hpyl88IX 
Hin4I | 

Mnll Sfcl HphI | | BplI 

II III | 

T T TAT CT T TGC C T AT AGGGG T GAAGT T T GAG AAGT TCTC T GAT T G T AAT G AC T T T T CT T A 

2581 + + + + + + 2640 

AAAT AGAAACGGATAT CC CCACT TCAAACT CTTCAAGAG ACTAACAT TACT GAAAAG AAT 

BsrDI 
Dpnl | 

Hpyl88IX Dpnl Sau3AI | | 

Dpnl | Sau3AI | Alwl | | | Sfcl 

Sau3AI | | Hpyl78III| | Acil I | |) CviRI | 

III Ml I I III II 

TGATCTGACTTTATCCTATGTTCCTGATCTTATCCGCAATGATCCCAAATGCACTACAGC 

2641 + + + + + + 2700 

AC T AG ACT GAAAT AGGAT ACAAG GAC T AG AAT AGGCGT T AC T AG GGT T TAC GT G ATG T C G 

CviJI 

CviJI Hael 
NlalVI Haelll 
Acil | | StuI 
MspAlI | | Mnll Ddel Mwol | 

IN I I II 

ACTTGTAATCAGCGGAGCCTCTTGGGAAACTTATGCCAATAACTTAGCACGACAGGCCTT 

2701 + + + + + + 2760 

TGAACATTAGTCGCCTCGGAGAACCCTTTGAATACGGTTATTGAATCGTGCTGTCCGGAA 

BsrI 
CviJI | 

Maelll Haellll 
Tsp45l Eael 
Cac8I | Gdill 
CviRI | | Bsgl BsiHKAI I 
CviRI CacSI | | | Tthllllll Bspl286l| 
I III I II || 
GCAAGTGCGTGCAGGCAGTCACTACGCCTTCTCTCCTATGTTTGAAGTGCTCGGCCAGTT 
2761 + + + + + + 2820 

CGTTCACGCACGTCCGTCAGTGATGCGGAAGAGAGGATACAAACTTCACGAGCCGGTCAA 

Dpnl 
NlalV 

BamHI | Dpnl 
BstYI | Bglll t 
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Alwl 



Sau3AI I 



| | Alwl Mnll 



BstYI | 
Sau3AI | 
I I 



Hgal 
Tsp509l 



TGTCTTTGAAGTTCGTGGATCCTCACGGATTTATAATGTAGATCTTGGGGGTAAGTTCCA 

+ + + + + + 

ACAGAAACTTCAAGCACCTAGGAGTGCCTAAATATTACATCTAGAACCCCCATTCAAGGT 



Dpnl 
Sau3AI i 
BseMII | | 
Ddel | | I 
Hpyl88IX | | I 
BplI I lit 
Apol II II! 
Tsp509I li III 
BsmAI III III 
Hpyl8 8lX| III III 
BseMIIi I I I I I II 
Ddel I I I I II I II 
Nlalll | I I I I II I II 
BsmAI I I II I I II t II 



Bfal 
BslI 



BsmBI 



Hin4I 



Msel 



ATTCTAGGAGCGTCTCTCATGTCTCAGAAATTCTGAGAGAGATCGCATTTAGGATTTTCT 

' i I — — — .-t- 



+ 2940 



2881 



TAAGATCCTCGCAGAGAGTACAGAGTCTTTAAGACTCTCTCTAGCGTAAATCCTAAAAGA 



TAAACACGAC 



2941 



+ 2950 



ATTTGTGCTG 
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